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I .  INTRODUCTION 


Practical  decisions  are  invariably  made  under  conditions  of  uncertainty;  i.e. 
short  of  complete  information.  Indeed,  decisions  (whether  large  or  small)  are  seldom 
made  with  complete  or  perfect  information.  Intelligent  decision  making  obviously 
requires  the  availability  of  good  information;  the  proper  analysis  of  this  information, 
of  course,  is  equally  important.  In  the  case  of  decision  situations  involving  uncertainty, 
the  necessary  information  may  have  to  be  expressed  in  terms  of  probability;  its  analysis 
would  naturally  require  probabilistic  concepts  and  modeling.  Moreover,  the  criterion  or 
basis  for  decision,  or  choice  of  action,  may  have  to  be  defined  statistically,  for 
example,  on  the  basis  of  maximum  expected  utility. 

Often,  the  judgment  and  preference  of  a  decision  maker  have  an  overriding  influence 
on  the  final  choice  of  action.  In  some  cases,  no  amount  of  formal  analysis  can  really 
influence  the  intuitive  bias  of  a  decision  maker.  Nevertheless,  under  conditions  of 
uncertainty,  and  in  complex  situations,  there  are  certain  analytical  tools  that  may  be 
pertinent  and  useful  for  systematically  synthesizing  available  information  (even  though 
incomplete)  and  subjective  judgment,  taking  into  consideration  any  preference  or  bias 
of  the  decision  maker.  The  concepts  and  tools  of  statistical  decision  theory  provide 
the  framework  for  these  purposes. 

A  formal  statistical  decision  analysis  will  depend  on  the  acceptability  of  the 
decision  basis,  i.e.  decision  criterion,  as  well  as  on  the  definition  and  measure 
of  consequence  such  as  a  utility  function,  used  in  the  process.  The  implementation  of 
the  decision  concepts  summarized  herein,  therefore,  necessarily  requires  the  establish¬ 
ment  of  a  realistic  utility  function  appropriate  for  a  given  problem;  for  this  reason, 
the  application  of  these  concepts  in  real  decision  problems  must  be  problem-specific. 

In  practice,  decisions  invariably  involve  a  choice  of  an  action  from  among  a  finite 
number  of  feasible  alternatives.  A  formal  decision  analysis,  therefore,  must  include 
the  evaluation  of  the  consequence  or  payoff  associated  with  each  of  the  feasible 
alternatives. 

In  short,  decisions  can  be  made  with  or  without  any  formal  analysis;  the  intuition 
and  judgment  of  a  decision  maker  often  have  overriding  influence  on  his  choice  of  altern¬ 
atives.  Under  conditions  of  uncertainty,  however,  statistical  decision  theory  provides 
a  systematic  framework  on  which  the  decision  making  process  may  be  intelligently  carried 
out.  This  involves  the  analysis  of  available  information,  including  judgmental  infor¬ 
mation,  and  the  assessment  of  the  potential  consequences  of  each  feasible  alternative 
action.  Perfect  decisions  (i.e.  free  of  the  possibility  of  a  wrong  action)  should  not 
be  expected  even  with  an  elaborate  statistical  analysis.  A  formal  decision  making 
process  only  provides  the  framework  for  systematically  analysing  all  available  informa¬ 
tion  (including  subjective  judgments)  and  for  consistently  and  objectively  evaluating 
the  potential  consequences,  taking  into  consideration  any  preferential  bias  of  the 
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decision  maker.  Hopefully,  the  process  will  minimize  the  probability  of  a  wrong  choice 
of  alternative;  at  the  very  least,  it  may  provide  additional  insight  on  the  problem. 

The  basic  concepts  of  statistical  decision  theory  are  presented;  also,  the 
potential  roles  and  applications  of  these  concepts  in  strategic  problems  are  emphasized. 
Implementation  of  statistical  decision  concepts,  however,  is  seldom  straight-forward. 
Invariably,  innovative  formulations  and  modeling  of  the  specific  physical  problem  will 
be  necessary;  examples  of  these  are  illustrated  with  special  reference  to  strategic 
defense  problems.  As  the  numerical  results  will  require  extensive  computer  calcula¬ 
tions,  the  illustrations  are  limited  to  the  discussions  of  the  principles  involved  and 
formulations  of  the  necessary  models. 


6 


\ 


II.  BASIC  STATISTICAL  DECISION  CONCEPTS 

2.1  The  Decision  Model 

Formally,  a  statistical  decision  model  may  be  defined  in  terms  of  four  major  com¬ 
ponents,  as  follows: 

1.  The  feasible  alternatives. 

2.  The  possible  outcomes  associated  with  each  alternative. 

3.  The  probabilities  associated  with  each  possible  outcome. 

4.  The  consequence  associated  with  each  alternative  and  a  given  outcome. 

All  of  these  components  may  be  portrayed  conveniently  in  the  form  of  a  decision  tree. 
Figure  2.1  illustrates  an  example  of  a  decision  tree  applied  to  a  hypothetical  target¬ 
ing  decision  problem.  The  decision  tree  starts  out  with  a  decision  node  (square  node 
in  Fig.  2.1),  at  which  point  the  decision  maker  identifies  the  various  feasible  altern¬ 
atives.  In  the  case  of  the  example  problem  of  Fig.  2.1,  the  choices  are 

A^  =  deploy  a  single  large  weapon 
-  deploy  several  small  weapons 

A^  =  acquire  further  intelligence  data  on  the 
enemy's  site  condition 

For  each  alternative,  there  is  a  chance  node  (circular  node  in  Fig.  2.1)  which  is 
followed  by  several  branches  denoting  a  set  of  possible  outcomes  associated  with  this 
particular  action.  In  this  example,  the  uncertainty  lies  in  the  site  condition  of  the 
enemy's  facility,  which  is  assumed  (for  simplicity)  to  be  either  soft  soil  (0^)  or 
hard  rock  (e^).  Presumably,  a  single  large  weapon  would  be  more  effective  in  destroying 
a  hard  rock  site, whereas  several  small  weapons  will  be  more  cost-effective  for  a  soft 
soil  condition.  In  alternatives  A^  and  A^  the  probabilities  of  encountering  soft  soil 
or  hard  rock  may  be  assessed  using  available  geological  information  combined  with  judg¬ 
ment;  these  are  indicated  at  the  respective  branches  as  0.3  and  0.7  in  Fig.  2.1.  In 
this  example,  these  probabilities  are  assumed  to  be  invariant  with  weapon  types.  Observe 
that  the  outcomes  at  each  chance  node  are  mutually  exclusive  and  collectively  exhaustive; 
consequently,  the  probabilities  associated  with  all  the  possible  outcomes  at  a  chance 
node  should  sum  up  to  unity. 

A  third  alternative,  A^,  is  included  in  the  decision  tree  of  Fig.  2.1,  which 
involves  acquiring  futher  intelligence  data  on  the  enemy's  site  condition.  Depending 
on  whether  the  intelligence  data  indicates  soft  soil  (z^)  or  hard  rock  (z^) ,  the  decision 
maker  will  have  to  make  a  decision  at  node  B  as  to  which  weapon  system  to  use.  Since 
the  intelligence  data  may  be  subject  to  error,  the  possible  outcomes  of  the  subsequent 
branches  will  still  consist  of  9^  and  However,  the  probabilities  of  6^  and  will 

obviously  depend  on  whether  the  intelligence  data  favors  one  or  the  other  outcome.  For 
example,  if  intelligence  data  indicates  soft  soil  (i.e.  r.,),  the  probability  of  actually 
encountering  soft  soil  at  the  site  will  be  high,  but  not  necessarily  equal  to  1.0. 
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Figure  2.1  A  decision  tree  example 


Therefore,  conditional  probabilities  are  required  here  to  account  for  the  reliability 
of  the  intelligence  information. 

The  consequence  associated  with  each  "path"  (i.e.  a  given  action  combined  with 
a  specific  outcome)  may  be  measured  in  terms  of  a  "utility".  In  general,  each  path 
will  have  its  own  utility  measure  as  indicated  in  Fig.  2.1.  The  utilities  for  the 
various  paths  in  Fig. 2.1  were  evaluated  with  the  procedure  outlined  in  Appendix  A. 

The  basic  decision  in  the  decision  tree  of  Fig.  2.1  is  between  alternatives  A^ 
and  A^.  There  are,  therefore,  essentially  four  paths  that  must  be  ranked  in  order. 

The  objective  of  a  targeting  decision,  of  course,  is  to  destroy  an  enemy  installation 
with  the  optimal  weapon.  In  this  regard,  assume  that  a  number  of  small  weapons  will 
have  maximum  destruction  if  the  site  of  the  enemy's  installations  is  of  soft  soil, 
whereas  using  a  large  weapon  on  a  soft-soil  site  will  result  in  overkill.  With  this 
consideration,  the  different  possibilities  are  ranked  in  order  of  preference  as  shown 
in  Fig.  2.1,  such  that  ?  >  E^,  in  which  the  symbol  >  means  "is  preferred 

to". 

A  relative  utility  value  of  100  is  then  assigned  to  E^  and  0  to  E^;  i.e., 

u(El)  =  100 
u(E4)  =  0 

The  relative  utility  values  for  E2  and  are  then  evaluated  by  the  lottery  schemes 
of  Appendix  A  as  follows: 

The  objective  here  is  to  determine  the  relative  utility  values  for  E ^  and  E^  in 
such  a  manner  that  consistency  is  maintained.  For  this  purpose,  consider  two  lotteries 
in  one  there  is  certainty  of  obtaining  E2,  whereas  in  the  other  there  is  a  probability 
p  of  obtaining  E^  and  corresponding  probability  (1-p)  of  obtaining  E^ .  Clearly,  if 
p  =  1.0,  the  second  lottery  would  be  preferred;  whereas,  if  p  =  0  the  first  lottery 
would  be  preferred.  Therefore,  at  some  probability  p,  the  two  lotteries  would  be 
indifferent  to  the  decision  maker.  Suppose  that  this  is  at  p  =  0.6  and  (1-p)  =  0.4, 
as  shown  in  the  following  sketch. 


The  utility  value  for  E2,  therefore,  is 

u(E2)  “  P  u(E1)  +  (1-p)  u(E4) 

=  (0.6  x  100)  +  (0.4  x  0) 

=  60 

Similarly,  the  indifference  lotteries  for  determining  the  utility  for  may  be 
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On  the  basis  of  the  latter  lottery,  the  corresponding  utility  value  for  E^  is. 


u(E3)  =  p  u(E1)  +(l-p)  u(E4) 

=  (0.5  x  100)  +  (0.5  x  0) 

=  50 

It  may  be  observed  that  the  relative  utilities  between  E2  and  E^  are  consistent 
in  accordance  with  the  preference  ranking  of  E2  and  E^.  However,  consistency  must  be 
further  verified  relative  to  the  utility  values  of  and  E^ .  For  this  purpose, 
consider  other  pairs  of  lotteries;  for  example,  a  lottery  with  certainty  for  E2  and 
another  lottery  with  probabilities  p  and  (1-p)  of  obtaining  E^  and  E^,  respectively, 
as  follows: 


In  this  case,  suppose  that  the  indifference  probability  is  p  =  0.55  between  the  two 
lotteries.  Then,  the  new  utility  value  for  E2  becomes 

u'(E2)  -  (0.55  x  100)  +  (0.45  x  50)  =  77.5 

Since  u(E2)  /  u'(E2),  there  is  inconsistency  in  the  assignment  of  relative  utilities 
up  to  this  point. 

The  entire  process  may  be  revised  using  the  following  indifference  lotteries. 
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On  the  basis  of  the  above  lotteries, 

u(E2)  =  (0.8  x  100)  +  (0.2  x  0)  =  80 

u(E3)  =  (0.5  x  100)  +  (0.5  x  0)  =  50 

Again,  to  verify  consistency  consider  the  following  indifference  lottery: 


The  new  utility  value  for  E2  then  becomes 

u'(E2)  =  P  “(Ej)  +  (!-p)  u(Ej) 

=  (0.6  x  100)  +  (0.4  x  50) 

=  80 

Therefore,  u(E2)  =  u'(E2).  Hence,  complete  consistency  has  been  achieved. 

The  utility  values  for  the  four  paths,  therefore,  are: 
u(E  )  =  100 
u(E2)  =  80 
u(E3)  *  50 
u(E4)  =  0 

These  utility  values  are  also  shown  in  Fig.  1.1  for  the  first  four  paths  corresponding 
to  alternatives  and  A2- 

With  regard  to  the  paths  associated  with  alternative  A3>  the  relative  utility 
values  should  include  the  cost  of  acquiring  the  additional  intelligence  information. 

A  cost  of  five  utility  units  is  assumed  for  this  purpose.  On  this  basis,  the  relative 
utility  values  for  each  of  the  paths  associated  with  alternative  A3  can  be  obtained 
by  deducting  five  units  from  the  utility  values  of  the  corresponding  paths  associated 
with  and  A2  obtained  earlier.  The  results  are  then  those  shown  in  Fig.  2.1. 

Generalizations  —  The  exact  configuration  of  a  decision  tree  will  depend  on  the 
specific  problem.  Some  trees  may  have  many  alternatives  whereas  others  may  have  many 
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different  possible  outcomes.  Moreover,  Instead  of  discrete  branches  representing  the 
possible  outcomes  from  a  given  action,  a  continuous  spectrum  of  outcomes  may  sometimes 
originate  from  a  chance  node,  such  as  the  possible  outcomes  defined  by  the  value  of  a 
continuous  random  variable.  An  alternative  could  also  consist  of  a  sequence  of  actions 
and  feedbacks,  such  that  the  outcome  following  an  earlier  action  would  affect  the 
decision  at  a  subsequent  stage.  This  latter  analysis  is  called  a  "sequential  decision" 
process  and  may  be  represented  as  a  series  of  decision  nodes,  branches,  chance  nodes, 
etc.  in  a  decision  tree. 

2.2  Decision  Criteria 

A  formal  decision  analysis  requires  a  criterion  for  choice;  i.e.  a  rule  for 
determining  what  constitutes  a  "best  decision".  In  this  regard,  it  may  be  observed 
that  the  pay-off  values  (i.e.  utilities)  associated  with  each  path  depend  on  the 
particular  outcome  which  are  generally  not  known  deterministically;  otherwise,  the 
decision-maker  would  obviously  choose  the  alternative  with  the  highest  utility  value. 
Having  constructed  and  completed  the  decision  tree,  there  will  be  multiple  utility 
values,  u^,  U£,  ...»  u^,  each  associated  with  a  particular  set  of  action  and  outcome, 
and  the  corresponding  probabilities  p^,  p^,  ....  p^,  respectively.  This  is,  of  course, 
a  characteristic  of  decisions  under  conditions  of  uncertainty.  Depending  on  the 
temperament,  experience,  and  degree  of  risk  aversiveness  of  the  decision  maker,  the 
"best  decision"  may  mean  different  things  to  different  decision-makers,  and  at  different 
times.  Nevertheless,  certain  criteria  for  decisions  may  be  described  as  follows: 

1.  Mini-Max  Criterion  —  This  criterion  would  suggest  that  the  best  decision  is 
the  alternative  that  will  minimize  the  decision-maker's  maximum  possible  loss  (or 
negative  utility).  In  this  case,  the  decision-maker  would  never  venture  into  anything 
that  may  give  him  substantial  positive  utility  (even  though  with  high  probability)  as 
long  as  there  is  a  finite  chance  for  a  substantial  loss.  This  is  a  pessimistic 
approach. 

2.  Maxi-Max  Criterion  —  With  this  criterion,  the  best  decision  will  be  to  select 
the  alternative  that  will  maximize  the  decision-maker's  maximum  possible  gain  among  the 
alternatives,  irrespective  of  the  probability  of  achieving  this  maximum  gain.  Basically, 
this  is  an  optimistic  approach. 

The  mini-max  and  maxi-max  approaches  suggest  that  decisions  are  to  be  made  solely  on 
the  basis  of  the  utility  values;  in  particular,  both  approaches  ignore  the  probability 
values  associated  with  each  of  the  utility  values.  As  a  consequence,  a  considerable 
amount  of  Information  is  wasted;  moreover,  in  the  long  run  both  the  mini-max  and  maxi- 
max  criteria  may  accumulate  substantial  losses. 

3.  Maximum  Expected  Monetary  Value  Criterion  —  If  the  attributes  associated  with 
each  path  in  a  decision  tree  can  be  expressed  purely  in  terms  of  monetary  values,  the 
monetary  values  of  various  branches  for  an  alternative  could  be  weighted  by  their 


respective  probabilities  to  obtain  the  expected  monetary  value  of  the  alternative;  thus 


E(dt) 


1  p.  -d. . 
j  ij  iJ 


(2.1) 


where  d^  denotes  the  monetary  value  of  the  j 


.  th 


branch  associated  with  alternative  i,  and 
The  best  decision  then  is  to  choose  the  alterna¬ 
tive  with  the  maximum  expected  monetary  value;  i.e. 


p^j  is  the  corresponding  probability 


E(d  ) 
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Max  (£  p  .  ,d .  .  ) 
,  ,  iJ  iJ 


(2.2) 


4.  Maximum  Expected  Utility  Criterion  —  In  the  event  that  monetary  value  is  not 
the  only  measure  of  value  (as  in  the  case  of  a  risk  aversive  or  risk  affinitive  decision¬ 
maker),  or  if  the  attributes  describing  the  value  of  each  path  in  a  decision  tree 
involves  non-monetary  attributes,  a  more  general  approach  would  be  to  use  utility  u^ 
to  denote  the  consequence  associated  with  a  given  path.  In  such  a  case,  the  optimal 
decision  would  be  to  choose  the  alternative  with  the  maximum  expected  utility;  i.e. 


E(uopt)  =  PijUij)  (2‘3) 

where  u_  is  the  utility  of  the  jth  branch  associated  with  alternative  i. 

It  may  be  emphasized  that  the  utility  approach  described  above  already  accounts 
for  the  risk  aversiveness  of  most  decision-makers  for  avoiding,  whenever  possible, 
disastrous  outcomes;  this  is  accomplished  through  the  assignment  of  the  appropriate 
utility  value  or  developing  the  appropriate  utility  function.  Therefore,  based  on 
the  maximum  expected  utility  criterion,  the  decision-maker  chooses  the  alternative  that 
would  be  reasonably  acceptable  over  a  wide  range  of  possible  outcomes,  by  properly 
balancing  the  true  value  (gain  or  loss)  of  various  potential  outcomes  against  their 
probabilities  of  occurrence. 


2.3  Analysis  of  a  Decision  Tree 

A  basic  decision  tree  consists  of  one  decision  node,  such  as  that  shown  in  Fig.  2.2. 
If  the  maximum  expected  utility  criterion  is  used,  the  expected  utility  value  of  each 
alternative  is  first  computed  according  to  Eq.2.3ateach  chance  node,  from  which  the 
alternative  with  the  maximum  expected  utility  value  is  the  optimal  alternative. 

Sometimes,  a  second  or  subsequent  decision  is  required  depending  on  the  outcome 
of  the  first  decision;  in  such  cases,  there  will  be  more  than  one  decision  node  in  a 
decision  tree.  An  example  is  shown  in  Fig.  2. 3. In  this  case,  the  analysis  starts 
from  the  right  by  first  identifying  the  sub-trees,  such  as  B  and  C  as  shown  in  Fig.  2.3. 
The  expected  value  of  each  alternative  in  all  the  sub-trees  are  subsequently  computed. 
Assuming  that  the  maximum  expected  utilities  are  u  and  u  ,  respectivelv ,  for  sub-trees 
B  and  C,  the  corresponding  optimal  alternative  may  then  be  identified.  this  point. 
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the  decision  tree  may  be  simplified  as  shown  in  Fig.  2.4,  where  u*  and  u* 

B  C 

represent  the  utilities  of  the  entire  sub-trees  B  and  C,  respectively.  The  expected 
utility  of  each  alternative  at  node  A  may  then  be  computed  and  the  optimal  action 
selected  accordingly. 

In  a  large  decision  tree,  where  several  decision  nodes  could  appear  in  a  single 
branch,  as  in  the  case  of  sequential  decision  analysis,  the  process  of  sub-tree  analysis 
will  be  repeated  several  times,  moving  from  right  to  left  and  advancing  from  one 
decision  node  at  a  time.  Once  this  backward  analysis  is  completed,  the  decision  process 
will  start  from  the  left.  First,  the  optimal  alternative  is  selected  at  stage  1; 
depending  on  the  specific  outcome  of  this  first  action,  the  process  moves  to  stage  2. 

From  the  results  of  the  backward  analysis,  the  optimal  alternative  at  each  decision  node, 
presumably  has  been  identified  (as  marked  in  bold  lines  in  Fig.  2.5).  In  particular,  it 
involves  the  predetermined  optimal  alternative  at  the  decision  node  in  this  stage 
(namely,  stage  2).  Again,  depending  on  the  specific  outcome  at  the  chance  node  at  this 
stage,  the  decision  process  moves  on  to  stage  3,  at  which  point  another  predetermined 
optimal  alternative  may  be  selected. 

Four  types  of  analysis  may  be  involved  in  a  formal  statistical  decision  analysis; 


name ly : 

(1)  Prior  Analysis  —  Literally,  a  prior  analysis  involves  the  calculation  of  the 
probabilities  and  utility  values  for  each  path  in  a  decision  tree  based  on  existing 
information  and  prior  assumptions.  Any  alternative  requiring  the  acquisition  of 
additional  information,  therefore,  will  not  be  included  in  a  prior  analysis. 

(2)  Terminal  Analysis  —  This  involves  the  analysis  and  re-evaluation  after  the 
necessary  experiments  or  data-gathering  schemes  have  been  performed  and  completed.  The 
probabilities  of  the  possible  outcomes  are  updated  accordingly;  the  remaining  analysis 
are  then  similar  to  those  of  the  prior  analysis. 

(3)  Preposterior  Analysis  —  This  primarily  addresses  the  question  "should 
additional  Information  need  to  be  gathered?",  and  if  so  which  scheme  should  be  used. 

The  decision  tree  shown  in  Fig.  2.3  is  typical  of  problems  involving  preposterior 


analysis. 

(4)  Sequential  Analysis  —  This  consists  of  several  decision  stages;  at  each  stage, 
a  decision  is  made  on  the  basis  of  the  observed  results  of  the  previous  stage.  A 
typical  decision  tree  for  this  analysis  is  shown  in  Fig.  2.5. 

The  general  approach  for  each  of  the  various  analyses  described  above  have  been 
outlined  in  the  previous  sections. 


Example 

The  decision  tree  shown  in  Fig.  2.1  requires  prior  as  well  as  preposterior 
analyses;  the  latter  is  associated  only  with  alternative  A^  which  requires  the 
acquisition  of  additional  intelligence  information  before  selecting  an  action,  whereas 
the  analysis  associated  alternatives  A^  and  A^  are  strictly  prior  analysis. 
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Stage  2 


Stage  3 


Figure  2.5  Sequential  decision  analysis 


Based  on  the  probability  and  utility  values  shown  in  the  decision  tree  of  Fig.  2.1, 
which  are  based  on  existing  information  and  initial  assumptions,  the  expected  utilities 
for  alternatives  and  are,  respectively,  as  follows: 

E(U„  )  =  0.3  x  50  +  0.7  x  80  =  71 
A1 

EOT  )  =  0.3  x  100  +  0.7  x  0  =  30 

a2 

If  no  further  intelligence  information  were  to  be  gathered,  the  maximum  expected 
utility  criterion  would  suggest  using  alternative  A^  (i.e.  deploying  a  single  large 
weapon  in  the  attack). 

However,  if  the  acquisition  of  additional  intelligence  information  is  a  viable 
option  available  in  the  decision  process,  the  preposterior  analysis  for  alternative  A^ 
would  proceed  from  the  right  of  the  decision  tree  as  follows: 

If  the  additional  intelligence  data  is  (favoring  9^),  the  probabilities  are 
calculated  as  follows: 

E(U  |z  )  =  0.632  x  55  +  0.368  x  75  =  62.36 
A1  1 

E(Ua  |zl)  =  0.632  x  95  +  0.368  x  (-5)  =  58.20 
whereas  if  is  the  outcome  of  the  additional  intelligence  gathering,  then 

E(U  |z  )  =  0.097  x  55  +  0.903  x  75  =  73.06 
A1  l 

E(U,  |z„)  =  0.097  x  95  +  0.903  x  (-5)  =  4.70 
a2  2 

implying,  therefore,  that  alternative  A^  (using  a  single  large  weapon)  would  be  the 
preferred  action  at  node  B  of  the  decision  tree  irrespective  of  the  outcome  of  the 
additional  intelligence  gathering.  Therefore,  the  expected  utility  for  action  A^  is 
(refer  to  Fig.  2.1) 

E(U  )  =  0.38  x  62.36  +  0.62  x  73.06  =  68.99 
A3 

Since  E(UA  )  is  slightly  less  than  the  expected  utility  of  A^,  the  optimal  decision, 
therefore, ^is  to  use  a  single  large  weapon  in  the  target  plan.  In  this  example,  the  cost 
of  intelligence  gathering  was  assumed  to  be  -5  utility  units;  according  to  the  above 
analysis  the  additional  information  is  not  worth  the  cost.  Of  course,  if  the  cost  of 
intelligence  is  much  less  than  -5  utility  units,  then  it  may  be  worthwhile  to  acquire 
additional  intelligence  information  before  choosing  a  weapon  system;  i.e.  alternative 
A^  may  become  optimal. 


2  .A  Procedure  and  Requirements  In  Implementation 

As  indicated  earlier,  a  statistical  decision  analysis  will  consist  of  the  follow¬ 
ing  components:  (1)  the  identification  of  the  feasible  alternative  actions  that 
require  further  analysis;  (2)  the  identification  of  the  possible  outcomes  associated 
with  a  given  action;  (3)  the  assessment  of  the  probability  of  occurrence  of  each  of  the 
possible  outcomes;  (A)  the  evaluation  of  the  utility  associated  with  each  path  (i.e.  an 
action  and  a  given  outcome). 

Identification  of  Alternatives  —  In  identifying  the  feasible  alternatives  for 
further  decision  analysis,  one  should  not  simply  exhaustively  list  all  the  possible 
actions  that  may  be  available.  Often,  there  are  options  that  are  clearly  not  practically 
feasible  or  viable;  such  possible  actions  are  therefore  *- --t  feasible  alternatives.  By 
eliminating  the  obviously  infeasible  alternatives,  Lne  feasible  alternative  actions  are 
usually  limited  in  number;  these  are  the  ones  tha*-  require  more  careful  analysis. 

Among  the  feasible  alternatives,  there  may  he  those  that  require  the  acquisition 
of  additional  information  prior  to  a  final  decis*'  •.  tn  this  latter  case,  the  additional 
information  may  involve  the  use  of  consultar  ,  or  ( iie  performance  of  a  field  or 
laboratory  test;  the  lattev  may  also  involve  decisions  on  the  choice  of  a  test  plan. 

In  the  case  of  a  sequential  or  staged  decis t^n  process,  the  alternatives  at  a 
subsequent  stage  may  be  different  from  those  of  the  earlier  stage.  As  new  information 
and  developments  are  made  available,  the  available  alternatives  may  change  —  the 
feasible  alternatives  at  an  earlier  stage  may  become  infeasible  at  a  later  stage; 
conversely,  other  possible  actions  which  were  not  feasible  at  an  earlier  stage,  may 
become  feasible  at  a  later  stage. 

The  Possible  Outcomes  —  In  a  well-defined  problem,  the  possible  outcomes  result¬ 
ing  from  a  given  action  should  be  exhaustively  identified.  Oftentimes,  however,  the 
possible  outcomes  may  not  be  obvious;  for  example,  they  may  depend  on  the  opponent's 
strategy  which  may  not  be  fully  revealed  until  it  is  too  late.  In  such  cases,  of 
course,  the  exercise  of  sound  judgment  in  the  identification  of  the  possible  outcomes 
is  important. 

In  the  event  that  the  occurrence  of  a  set  of  possible  outcomes  is  controlled  by 
another  process  which  may  be  random,  an  additional  chance  node  could  precede  the 
first  one,  denoting  the  two  levels  of  outcomes  possible  in  this  case.  As  an  example, 
the  parameters  of  a  probability  distribution  of  a  random  variable  may  be  unknown; 
thence,  the  probability  of  occurrence  of  a  specific  value  of  the  random  variable  will 
depend  on  the  value  of  the  parameter  which  is  itself  a  random  variable. 

Assessment  of  Probabilities  —  Clearly,  if  more  than  one  outcome  is  possible 
subsequent  to  a  given  action,  the  probabilities  of  occurrence  of  the  various  outcomes 
are  important.  It  may  be  emphasized  that  the  probabilities  are  important  to  denote 
the  relative  likelihoods  of  occurrence  of  the  various  possible  outcomes,  and  therefore 
is  the  main  characteristic  of  decision  making  under  conditions  of  uncertainty.  The 
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required  probabilities  are  necessarily  estimated  or  calculated  values,  which  may  be 
evaluated  on  the  following  basis: 

(1)  Based  entirely  on  past  observed  data —  In  this  instance,  the  probabilities 
of  future  occurrences  of  the  possible  outcomes  will  be  based  purely  on  the  statistical 
observations  of  the  past.  The  validity  of  this  approach  will  require  a  large  set  of 
observed  data,  which  is  very  rarely  the  case  in  practice. 

(2)  Based  on  subjective  judgment  of  the  decision  maker  —  In  this  case,  the 
estimated  probabilities  are  purely  subjective;  consequently,  rigorous  justification  of 
such  probabilities  will  obviously  be  difficult.  The  validity  of  such  probabilities 
must  rely  on  the  quality  or  credibility  of  the  judgment  of  the  decision  maker. 

(3)  Based  on  the  combination  of  observed  data  and  subjective  judgment  —  The 
Bayes'  theorem  is  the  vehicle  for  combining  the  two  types  of  information. 

Specific  methods  have  been  suggested  for  extracting  subjective  probabilities; 
these  include  the  following: 

(i)  Use  of  a  probability  wheel  (Spetzler  and  Stael  von  Holstein,  1972)  and 
repetitive  questioning  until  the  indifference  of  two  lotteries  is  achieved. 

(ii)  Establishing  the  subjective  distribution  of  a  random  variable  by  the  fractile 
method  (Raiffa,  1968;  Schlaefer,  1969),  in  which  the  range  of  a  random  variable  is 
divided  into  two  equally  likely  halves,  each  of  which  is  then  subdivided  further;  the 
process  continues  until  the  required  accuracy  of  the  probability  calculations  in  the 
distribution  tail  regions  is  achieved. 

(iii)  The  probability  distribution  over  a  given  range  may  be  assumed;  this  could 
include  the  uniform  and  triangular  distribution,  the  latter  distribution  could  be 
specified  with  various  degrees  of  skewness  to  reflect  the  relative  likelihoods  over 
the  range. 

(iv)  Finally,  the  Delphi  method  may  be  used  if  the  subjective  judgments  and 
opinions  from  a  group  of  experts  are  available.  Essentially,  this  involves  the 


repeated  up-dating  of  individual  opinions  and  estimates  in  light  of  the  opinions  and 
judgments  from  the  others  in  the  group.  In  addition  to  the  subjective  assessment  of 
probabilities,  the  Delphi  method  may  be  used  also  for  evaluating  and  assigning  utility 
values . 

In  order  to  combine  subjective  judgment  with  available  data,  the  probabilities 
initially  obtained  on  the  basis  of  judgment  may  be  updated  in  light  of  the  available 
data;  the  updating  process  may  be  performed  on  the  basis  of  the  Bayes'  theorem.  Such 
a  situation  would  occur  in  the  case  of  alternative  in  Fig.  2.1;  the  probabilities 
available  for  assessing  the  reliability  of  the  experiment  may  be  expressed  as  P(z  |o  ) 
which  is  the  probability  of  the  experiment  yielding  the  outcome  z^  given  that  the 
actual  outcome  is  8^.  Since  conditional  probabilities  P(0j|z^)  are  required,  the 
Bayes'  theorem  yields 


(2.4) 


P(z,|e  )  p,(0,> 

P(e  |z  )  =  r-£-J - J — 

J  5  p(zje.)  P’(6.) 

where  P' (0^)  is  the  prior  probability  of  0^  which  may  have  to  be  assessed  purely  on  the 

basis  of  judgment,  and  P(0.jz  )  is  the  updated  probability  of  0.  following  the 

1  °  j 

result  of  the  experiment  z^ .  Furthermore,  the  probability  that  the  experiment  will 
yield  z.  is  given  by  the  theorem  of  total  probability  as  follows: 


P(zz)  =  l  P<zje  )  P'(0.)  (2.5) 

j  3  J 

In  the  absence  of  experience  or  other  judgmental  information,  a  diffuse  prior 
probability  may  be  assumed;  this  implies  that  P*  (0  )  is  constant  for  all  j.  Therefore, 
no  specific  bias  toward  any  particular  outcome  is  implicitly  assumed  with  the  diffuse 
prior  distribution.  A  further  application  of  the  diffuse  prior  will  be  discussed  in 
the  section  on  Bayesian  sampling  theory. 

It  might  be  emphasized  that  in  practice,  available  observational  data  is  invariably 
insufficient  for  decision  making  purposes,  whereas  decisions  made  entirely  on  the  basis 
of  subjective  judgment  may  be  difficult  to  justify  and  accept.  In  practice,  therefore, 
some  combination  of  subjective  judgment  combined  with  available  prior  information  and 
observational  data  is  invariably  the  most  viable  approach  to  the  calculation  of  the 
probabilities  for  decision  making  purposes.  In  short,  available  information  should  be 
used  to  its  fullest  extent,  supplemented  with  good  judgment  and  reasonable  assumption 
whenever  necessary. 


To  illustrate  the  updating  process  implied  in  Eq.  2.4,  consider  the  alternative  A., 
of  the  example  discussed  in  Fig.  2.1. 

Suppose  that  the  reliability  of  the  intelligence  information  is  80%;  i.e. 

P(z^[ep  =  0.8  and  P(z2(02)  =  0.8.  Conversely,  this  also  implies  that  P(Zjj02)  = 
p(z2|e1)  -  0.2. 

With  the  prior  probability  estimates  (i.e.  may  be  based  purely  on  judgment)  of 
P,(91)  «  0.3  and  P' (92>  =  0.7,  Eq.2.4  yields 


PC01'*1>  = 


P(z1|e1)  p'^) 

PUje^  p'fOj^)  +p(z1|e2)  p’(e2) 


_ 0.8  x  0.3 _ 

0.8  x  0.3  +  0.2  x  0.7 
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and 


p(e2|z1)  =  1  -  p(e1|z1)  =  0.368 


Similarly, 

P(z  |0  )  P'(0  ) 

P(0  |  )  =  - - L— - — 

p(z  |e  )  P'(e  )  +  p(z  |e  )  P'(e  } 


0.2  x  0.3  _ 

0.2  x  0.3  +  0.8  x’  0.7 


=  0.097 


and,  P(02|z2)  =  1  -  P(91 | z2>  =  0.903 


Also,  in  light  of  the  intelligence  data,  the  probabilities  of  soft  soil  or  hard 
rock  conditions  (i.e.  z^  or  z^)  would  be  calculated  according  to  Eq .  2.5  as 
follows : 


and. 


p(Zl)  =  p(z1|e1)  P'(e1)  +  p(z1|b2)  P'(e2) 

=  0.8  x  0.3  +  0.2  x  0.7 
=  0.38 

P(z2)  =  PCzjle^  p*  (Qj^)  +  P(z2|e2)  P’(02) 

=  0.2  x  0.3  +  0.8  x  0.7 

=  0.62 


21 


III.  UTILITY  AND  UTILITY  FUNCTION 


3.1  Utility  Measure 

Aside  from  the  calculation  of  the  probabilities  in  a  decision  tree,  the  other 
important  requirement  is  the  evaluation  of  the  utility  values  associated  with  the 
various  actions  and  outcomes  in  a  decision  tree.  For  this  purpose,  a  utility  function 
would  generally  be  needed.  Unless  the  consequence  of  a  given  action  can  be  expressed 
purely  in  monetary  terms,  a  utility  function  would  need  to  be  developed;  such  a  utility 
function  must  be  suitable  for  the  specific  problem  under  consideration. 

The  importance  of  a  proper  and  realistic  utility  function  is  central  to  the  appli¬ 
cation  of  statistical  decision  theory.  Indeed,  the  significance  of  decision  theory  to 
practical  problems  will  depend  heavily  on  the  successful  formulation  of  realistic  and 
viable  utility  functions. 

A  proper  utility  function  must  necessarily  be  problem-specific;  it  would  depend 
on  the  gain  parameters  or  loss  parameters  pertinent  to  a  given  problem.  For  this  reason, 
no  single  utility  function  or  type  of  utility  function  can  be  applicable  to  all  problems. 
Nevertheless,  there  are  certain  attributes  that  may  be  useful  to  a  large  class  of  prob¬ 
lems,  which  may  be  identified  as  follows: 

1.  Utility  in  Terms  of  Monetary  Value  —  For  a  class  of  problems,  the  payoff  or 
consequence  from  a  given  action  and  possible  outcome  may  be  expressed  in  monetary  terms. 
All  tangible  and  intangible  attributes  are  assigned  monetary  values.  In  this  regard 
assumptions  and  value  judgment  may  be  necessary  for  evaluating  or  assigning  monetary 
costs  to  all  potential  consequences,  including  the  cost  of  a  fatality  or  the  cost  per 
unit  gain  of  information. 

2.  Consequence  in  Terms  of  Utility  Value  —  Monetary  value  may  not  be  a  suitable 
measure  of  cost  or  payoff.  For  this  reason,  the  concept  of  utility  may  be  appropriate 
as  a  more  general  measure  of  payoff,  and  also  for  combining  several  types  of  attributes. 
In  this  regard,  two  approaches  may  be  considered: 

(i)  Denote  the  consequence  of  each  path  (i.e  each  alternative  and  a  given  outcome) 

by  E^.  Rank  the  results  E^'s  in  terms  of  the  preference  of  the  decision-maker  such 

that  E,  >  E„  >  . . .  >  E  where  >  denotes  "is  preferred  to"  and  n  is  the  number  of  paths 
1  Z  n 

in  the  decision  tree.  Assign  the  utility  values  of  1  and  0,  respectively,  to  E^  and  E^. 
Conduct  an  inquiry  (see  Appendix  A)  perhaps  between  the  decision  analyst  and  the  decision 
maker,  in  order  to  establish  a  consistent  set  of  utility  values  for  all  the  other  paths 
E^,  for  i  ■  2,  3,  ....  n-1. 

(ii)  Alternatively,  if  a  scale  measure  already  exists  with  a  specific  attribute, 
such  as  time  or  fatalities,  or  monetary  value,  a  utility  function  may  be  established 
to  transform  all  the  different  scales  into  a  uniform  utility  scale.  The  reason  for 
this  transformation  is  that  the  original  individual  scales  may  not  be  consistent  with 
the  decision-maker's  preference.  For  example,  a  fatality  of  200  may  be  more  than  twice 
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the  consequence  of  a  fatality  of  100.  Considerations  such  as  these  may  be  introduced 
and  should  be  considered  in  the  formulation  of  the  proper  utility  function.  A  procedure 
for  developing  such  a  utility  function  is  summarized  in  Appendix  B. 

For  a  decision  problem  involving  only  one  attribute,  the  utility  function  described 
above  is  sufficient  for  assigning  or  evaluating  the  utility  value  for  each  path  in  a 
decision  tree.  In  a  typical  case,  however,  several  attributes  may  be  involved.  In  these 
cases,  a  joint  (or  multi-attribute)  utility  function  is  needed  for  evaluating  the 
expected  utility  of  an  alternative  action.  The  determination  or  formulation  of  a  joint 
utility  function  (see  Raiffa,  1969,  Fishburn,  1970),  could  be  cumbersome  especially 
when  the  number  of  attributes  becomes  large.  Assumptions  such  as  the  concept  of  prefer¬ 
ential  independence  and  utility  independence  are  often  invoked  to  simplify  the  assessment 
and  formulation  of  joint  utility  functions  (Keeney,  1972). 

3.  Consequence  in  Terms  of  Cost-Effectiveness  Ratios  —  In  certain  decision  prob¬ 
lems,  the  attributes  may  involve  monetary  costs  and  another  attribute;  the  latter  may 
be  the  number  of  fatalities  saved  (in  the  case  of  an  enemy  attack)  or  the  amount  of 
information  gained  (from  performing  an  experiment) ,  or  the  damage  imposed  on  an  enemy 
installation  (in  the  case  of  targeting) .  In  such  problems,  a  suitable  measure  of  utility 
may  be  expressed  in  terms  of  the  unit  cost  per  fatality  saved,  or  unit  cost  per  unit  of 
information  gained  (e.g.  from  an  experiment). 


3.2  Common  Types  of  Utility  Functions 

Most  utility  functions  are  convex;  this  means  that  the  marginal  increase  in 
utility  decreases  with  increasing  value  of  an  attribute.  The  preference  behavior  of 
a  decision-maker  exhibited  by  a  convex  utility  function  is  commonly  referred  to  as 
"risk  aversiveness".  Most  people  are  risk  aversive  to  a  certain  degree;  some  may  be 
more  risk  aversive  than  others.  The  mathematical  forms  of  utility  functions  commonly 
used  to  model  such  risk  aversive  behavior  would  include  the  following: 

1.  Exponential  Utility  Function: 


U(x)  =  a  +  be~Yx  (3.1) 

where  the  parameter  y  is  the  measure  of  the  degree  of  risk  aversion,  and  a  and  b  are 
normalization  constants.  If  the  utility  function  is  normalized  such  that  U(0)  =  0 
and  U(l)  =  1,  then  the  normalized  exponential  utility  function  becomes 
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the  decision-maker;  i.e.  as  0  increases,  the  decision-maker  has  more  utility  to  spare 
(such  as  money),  and  he  becomes  less  risk  aversive.  A  normalized  logarithmic  utility 
function  may  be  shown  to  be: 


U(x)  =  - ~~ —  [  £n (x+g)  -  in  0)  (3.4) 

3 .  Quadratic  Type 

U(x)  =  a(x  -  l/2a  x^)  +  b  (3.5) 


where  a  is  the  parameter  related  to  the  degree  of  risk  aversion.  A  normalized  quadratic 
utility  function  may  be  shown  to  be: 


U  (x) 


1,12, 

r-o/2  (X  -  2“  x } 


(3.6) 


The  degree  of  risk  aversiveness  of  a  decision-maker  is  measured  by  the  "risk 
aversion  coefficient," 


r  (x) 


u'lOO 

u’(x) 


(3.7) 


where  the  prime  (')  denotes  derivative  with  respect  to  x.  This  coefficient  measures 
the  negative  rate  of  change  in  curvature  of  the  utility  function  with  respect  to  a 
unit  change  in  the  slope  of  the  utility  function.  For  the  exponential  utility 
function,  the  risk  aversiveness  can  be  shown,  using  Eq.  3.2  in  Eq .  3.7,  to  be  a  constant 
i.e. 

r(x)  =  Y.  (3.8) 


Applying  Eq.  3.7  with  Eqs.  3.4  and  3.6,  the  coefficients  of  risk  aversion  for  the 
normalized  logarithmic  and  quadratic  utility  functions  can  be  shown  to  be,  respectively. 


r  (x) 


1 

x+8 


(3.9) 


and. 


r(x) 


a 

1-ax 


(3.10) 


which  are  both  functions  of  x. 

Observe  that  the  coefficient  of  risk  aversion  does  not  vary  with  the  attribute 
in  the  case  of  the  exponential  utility  function;  whereas  in  the  case  of  the  logarithmic 
utility  function  the  coefficient  of  risk  aversion  decreases  with  x  and  in  the  case  of 
the  quadratic  utility  function  the  risk  aversion  increases  with  x. 


r 


The  utility  function,  of  course,  may  also  be  concave  upwards;  that  is,  the 
marginal  increase  in  utility  increases  with  increasing  values  of  the  attribute  x.  In 
such  cases,  the  preference  behavior  of  the  decision-maker  is  referred  to  as  risk 
affinitive.  It  is  believed  that  this  preference  behavior  is  ordinarily  not  realistic. 

3.3  Sensitivity  of  Expected  Utility  to  Form  of  Utility  Function 

Usually,  it  is  difficult  to  ascertain  which  type  of  utility  function  is  most 
appropriate;  e.g.  whether  it  should  be  of  the  exponential  or  quadratic  form.  The 
correct  choice  of  the  form  of  the  utility  function,  however,  may  not  be  very  crucial, 
especially  if  the  expected  utility  values  are  not  sensitive  to  the  form  of  the  function. 

In  order  to  examine  the  sensitivity  of  the  expected  utility  associated  with  a 
given  action,  to  the  above  three  forms  of  utility  functions,  consider  the  simple  case 
in  which  the  possible  outcomes  from  an  action  can  be  described  by  the  value  of  a  random 
variable  X.  In  this  case,  the  expected  utility  of  a  given  action  may  be  expressed  as 
follows . 

' 

E(U)  =  u(x)  fx(x)  dx 

where  fx(x)  is  the  probability  density  function  of  X  and  U(x)  is  the  utility  function. 
Using  the  second-order  approximation  to  evaluate  the  expected  utility  (see  Ang  and 
Tang,  1975),  the  result  is 

E(U)  =  u(x)  +  1/2  Var(x)  +  u" (x)  (3.11) 

where  x  and  var(x)  are  the  mean  and  variance  of  the  random  variable  X,  and  u"(x)  is 
the  second  derivative  of  the  utility  function  evaluated  at  the  mean  value  of  X. 

Applying  Eq.  3.11  to  the  three  types  of  utility  functions  described  earlier,  the 
second-order  approximation  of  the  expected  utility  becomes,  respectively,  as  follows: 

For  the  exponential  utility  function, 

E(U)  ~  — - —  (1  -  e“YX  “  4  2  Var(x) *e  YX  (3.12) 

l-e_Y 

For  the  logarithmic  utility  function, 

E(U)  * - “T  -  Un  (x+6)  -  ing  -  \  Var(x)  •  ~~~ ,]  (3.13) 

en(^)  (X+S)Z 

Finally,  for  the  quadratic  utility  function, 

E(U)  *  --  [x  -  v  a (x2  +  Var  X)]  (3.14) 

1-9-  2 

1  2 
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In  order  to  compare  the  expected  utility  obtained  for  the  three  different  forms 
of  utility  functions,  each  of  the  utility  functions  given  in  Eqs.  3.2  through  3.6  is 
calibrated  to  have  the  same  coefficient  of  risk  aversion  at  the  mean-value  of  the 
random  variable  X.  In  other  words,  substituting  x  for  X  and  y  for  r  in  Eqs.  3.9  and 
3.10,  we  obtain 


A  numerical  study  was  performed  for  the  case  where  x  =  0.5,  and  var(X)  =  0.1  and 
2 

0.25  .  The  numerical  results  are  summarized  in  Table  3.1. 

From  the  results  shown  in  Table  3.1,  two  observations  may  be  deduced,  as  follows: 

1.  The  expected  utility  is  relatively  insensitive  to  the  form  of  the  utility 
function  at  a  given  level  of  risk  aversion;  the  difference  in  the  expected  utility  is 
less  than  2%  among  the  three  types  of  utility  functions  examined  herein. 

2.  The  expected  utility  does  not  change  significantly  (at  most  20Z)  over  the 
range  of  y  (from  0.25  to  1.50)  examined  herein. 

The  implication  of  these  observations  is  that  the  exact  form  of  the  utility  function 
will  not  be  an  important  factor  in  the  computation  of  expected  utility.  Moreover, 
the  risk  aversiveness  coefficient  in  the  utility  function  need  not  be  very  precise; 
i.e.  any  error  in  the  specification  of  the  risk  aversiveness  coefficient  may  not 
cause  significant  difference  in  the  calculated  expected  utility.  In  short,  the 
problem  of  ascertaining  an  accurate  utility  function  would  not  be  crucial  in  the 
application  of  statistical  decision  analysis. 

A  Related  Observation  —  As  indicated  in  Eq.  3.11,  an  approximate  expected  utility 
value  may  be  computed  on  the  basis  of  the  mean  and  variance  of  the  pertinent  random 
variable.  This  would  suggest  that  the  entire  probability  density  function  may  not  be 
necessary  in  most  decision  analysis  problems.  In  practice,  the  first  two  statistical 
moments  could  be  all  the  information  that  may  be  available  for  a  random  variable; 
hence,  Eq.  3.11  provides  a  convenient  approximate  formula  for  computing  the  expected 
utility  of  a  given  action. 
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Table  3.1  Comparison  of  expected  utility  for  different  forms  of  utility  function 


Expected  Utility  Value  (2nd-order  Approximation) 


Variance  (X) 

Y 

Exponential 
Func  t ion 

Logarithmic 

Function 

Quadratic 

Function 

o.i2 

0.23 

0.530 

0.530 

0.530 

II 

0.50 

0.563 

0.561 

0.560 

II 

1.00 

0.618 

0.626 

0.620 

tl 

1.50 

0.672 

0.  707 

0.680 

2 

0.25 

0.25 

0.523 

0.524 

0.523 

H 

0.50 

0.547 

0.548 

0.547 

ii 

1.00 

0.593 

0.603 

0.594 

it 

1.50 

0.636 

0.676 

0.641 

3 . 4  Sens  U: ivit^  of  Dec i s i o n 

The  question  of  whether  or  not  the  optimal  action  suggested  by  a  formal  decision 
analysis  is  sensitive  to  changes,  or  possible  changes,  in  the  probabilities  and/or 
utilities  assigned  to  the  various  brandies  and  paths  of  a  decision  tree  is  clearly  of 
interest.  In  view  of  the  fact  that  the  assignment  of  probabilities  are  often  based  on 
subjective  judgments,  the  sensitivity  of  the  optimal  alternative  to  variations  in  the 
input  information  clearly  deserves  some  attention. 

If  the  optimal  alternative  suggested  by  a  formal  decision  analysis  has  an  expected 
utility  value  that  is  far  greater  than  those  associated  with  the  other  alternatives, 
it  may  be  obvious  that  even  significant  changes  in  the  input  variables  would  not  alter 
the  final  results.  However,  in  situations  where  changes  in  the  input  variables  could 
conceivably  alter  the  optimal  alternative,  the  decision  analysis  could  proceed  assuming 
that  the  probabilities  and  utilities  are  variables  instead  of  assigned  numbers.  An 
example  is  presented  below  demonstrating  such  a  sensitivity  analysis  for  a  simple 
decision  tree,  in  which  the  pertinent  probability  factors  are  treated  as  variables.  It 
may  be  observed  from  this  example  that  the  unknown  probability  (or  utility)  does  not 

need  to  be  known  exactly,  so  long  as  it  can  be  estimated  to  be  within  a  range  in  which 

a  specific  alternative  can  be  shown  to  be  superior. 

Example 

Consider  the  hypothetical  decision  problem  of  Fig.  2.1.  Suppose  that  the  gathering 
of  additional  intelligence  data  is  not  a  viable  option.  Thus,  the  only  feasible 
alternatives  are  and  A£.  The  decision  maker  believes  that  there  is  a  high  probability 
that  the  site  will  be  hard  rock;  however,  the  exact  value  of  p  is  not  known.  From  the 

decision  tree  of  Fig.  3.1  the  expected  utility  of  alternatives  A^  and  A^  are 


27 


E(U  )  =  50  (1-p)  +  80  p  -  50  +  30p 
A1 

E(U  )  =  100  (1-p) 

2 

Figure  3.2  shows  a  plot  of  these  two  expected  utilities  as  a  function  of  p.  It  can 

be  observed  that  for  p  0.385,  E(U  )  >  E(U  ),  hence  using  several  small  weapons 

A2  i 

(A^)  is  a  better  alternative;  whereas  for  p  >  0.385,  using  a  single  large  weapon 

(i.e.  A^)  is  the  better  alternative.  Therefore,  even  though  the  exact  probability 

of  hard  rock  at  the  enemy’s  site  is  not  known,  a  knowledge  that  p  >  0.385  is 
sufficient  for  deciding  on  a  single  large  weapon. 

If  several  probabilities  and  utilities  have  large  uncertainties,  a  conservative 
approach  may  be  taken  to  investigate  whether  the  optimal  alternative  remains  valid; 
that  is,  the  extreme  probabilities  and  utilities  may  be  used  in  the  decision  analysis 
to  obtain  a  conservative  choice  of  action.  This  may  suggest  an  action  that  may  or  may 
not  be  the  same  as  the  action  obtained  on  the  basis  of  the  expected  utility  criterion. 
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Figure  3.1  Decision  tree  for  example  of  sensitivity  analysis 


Figure  3.2  Expected  utility  as  function  of  probability  p 


IV.  DECISION  CONCEPTS  IN  SAMPLING  AND  ESTIMATION 


Statistical  sampling  may  be  viewed  as  a  decision  problem  —  first  of  all,  there 
is  the  binary  decision  of  whether  or  not  sampling  should  be  undertaken  at  all,  and 
if  so  how  extensive  should  the  sampling  program  be  (i.e.  what  should  the  sample  size 
be?).  In  this  regard,  certain  concepts  of  Bayesian  statistics  are  pertinent. 

4 . 1  Bayesian  Sampling 

The  basic  assumption  in  the  Bayesian  approach  to  statistical  sampling  and  estima¬ 
tion  is  that  the  parameters  of  a  probability  distribution  of  a  random  variable  are 
themselves  random  variables.  The  uncertainty  associated  with  a  given  parameter  may 
also  be  described  with  a  probability  density  function. 

A  prior  probability  density  function  f'(0)  for  a  parameter  6  may  be  prescribed 
on  the  basis  of  judgment,  or  established  on  the  basis  of  prior  data  and  information. 
Observational  data  may  be  used  to  revise  or  update  the  prior  distribution  for  the 
parameter.  Based  on  the  Bayes  theorem,  the  updated  or  (posterior)  probability  density 
function  for  the  parameter  0  is  given  by  the  following  (see  Ang  and  Tang,  1975)  : 

f"(0)  =  k  1(0)  f'  (6)  (4.1) 


where : 


0  =  the  parameter  under  consideration; 

f '  (0)  =  the  prior  probability  density  function  of  0,  i.e. 
prior  to  the  observational  data; 

L(0)  =  the  likelihood  function  of  0,  given  as  the  product  of 

the  density  function  of  the  original  variable  X  evaluated 

at  the  observed  data  points  x, ,  x„ . x  ;  i.e. 

1  l  n 

KO)  =  "  fx(xt|e); 

i  =  l 


f"(9)  =  the  posterior  distribution  of  0,  i.e.  updated  in  light 
of  the  additional  observational  data;  and 

k  =  a  normalization  constant  to  insure  that  f"(0)  is  a 
proper  probability  density  function. 

As  may  be  observed  in  Eq.  4.1,  the  Bayesian  approach  allows  any  prior  information  on 
the  parameter  6  to  be  incorporated  systematically  in  the  final  determination  of  0. 


The  prior  information  could  be  based  on  previous  sampling  results,  or  indirect 
measurements,  or  simply  based  on  subjective  judgments.  In  the  event  that  there  is  no 
objective  basis  for  establishing  a  prior  distribution,  the  diffuse  prior  (i.e.  the 
uniform  distribution  between  0  and  1)  may  be  used.  In  such  a  case,  f'(8)  is  simply  a 
constant,  i.e.  not  a  function  of  0,  and  the  posterior  distribution  f"(9)  becomes 

n 

f " (0 )  =  k  L(8)  =  k  it 
i=l 

which  consists  of  sampling  information  only, 
distribution". 

Sampling  from  Gaussian  Population  —  As 
from  a  normal  population  X.  Assume  that  the 
of  the  population  p  is  the  only  parameter  to 
In  this  case,  the  likelihood  function  can  be 

L(p)  =  Nu(x,  2-) 

where  N^Cx.o/i^n)  denotes  the  normal  probability  density  function  for  p  with  mean  value 
x  and  standard  deviation  otJ~\ n.  x  is  the  sample  mean  of  the  n  data  values.  In 
other  words. 


fx(xj0)  (A. 2) 

and  may  be  referred  to  as  the  "data-based 

an  illustration,  consider  the  sampling 
2 

variance  c  is  known  and  the  mean-value 
be  estimated  from  a  sample  of  size  n. 
shown  to  be  (see  Ang  and  Tang,  1975) 


N  (x,o//n)  - - - -  exp  [-  \  (^7 )2]  (A.  3) 

“  2uo/ /n  2 

Hence,  the  data-based  posterior  distribution  of  p  is  simply  a  normal  distribution 
N^(x,o/i/n  ) 

It  can  be  shown  (Ang  and  Tang,  1975)  that  the  Bayesian  distribution  of  the  basic 
random  variable  X,  incorporating  the  effect  of  uncertainty  in  the  parameter  p  remains 

—  r 2  2 

normal  with  mean  x  and  standard  deviation  /a  +  o  /n.  The  sampling  uncertainty,  denoted 

2  2 
by  a  /n,  is  added  to  the  basic  or  inherent  variability  a  to  yield  the  overall 

uncertainty. 

On  the  other  hand,  if  a  prior  distribution  of  p  is  available,  e.g.  if  f'(p) 
is  normal  N  (p{  o'),  where  p'  and  o'  are  respectively  the  prior  mean  and  standard 
deviation  of  p,  then  the  posterior  distribution  of  p  can  be  shown  to  be  also  normal 
N^(p",  a")  in  which  (see  Ang  and  Tang,  1975), 


M 


•  I 


x(q*)  +  p  (q  /n) 

i  t  2 
(o  )  +  o  /n 


(A. A) 
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The  results  in  Eqs.  4.4  and  4.5  can  be  generalized  such  that  if  there  are  two 
independent  sources  of  information  on  the  parameter  p,  for  example  as  represented  by 
Np^l’qp  ancl  Np^*J2’q2^’  the  tW0  sources  can  combined  to  yield  an  overall  mean-value 
of  the  parameter  p  as  follows: 


P 


M 


+ 


+ 


(4.6) 


and  the  corresponding  standard  deviation  of  the  parameter  y  becomes. 


a 


M 


qlq2 

/  a^2  +  a  2^ 


(4.7) 


The  Bayesian  approach  described  and  illustrated  above  may  be  applied  also  to 
sampling  from  populations  that  are  not  normal;  for  such  cases,  see  Ang  and  Tang  (1975). 

The  relation  between  the  likelihood  function  and  the  prior  and  posterior  distri¬ 
butions  of  the  parameter  6  is  illustrated  in  Fig.  4.1.  Observe  that  the  posterior 
distribution  is  sharper  than  either  the  prior  distribution  or  the  likelihood  function. 
This  implies  that  more  information  is  contained  in  the  posterior  distribution  than  in 
either  the  prior  or  the  likelihood  function;  this  is,  of  course,  to  be  expected  as  the 
posterior  distribution  integrates  the  information  from  the  prior  with  that  in  the 
likelihood  function. 


I 


Figure  4.1  Prior,  likelihood  and  posterior  functions 
for  parameter  p 
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4 . 2  Bayes'  Point  Estimator 

Based  on  the  results  of  sampling,  and  any  prior  Information  that  may  be  available, 
the  previous  section  shows  that  an  improved  posterior  probability  distribution  for  a 
parameter  can  be  obtained  or  established.  For  many  practical  purposes,  however,  the 
point  estimator  (instead  of  the  distribution)  of  the  parameter  is  required  or  is  more 
useful.  Such  a  point  estimate  may  be  obtained  from  a  decision  analysis. 

Consider  the  decision  tree  in  Fig.  4.2  which  depicts  a  situation  in  which  the 
point  estimator  of  the  parameter  6  is  to  be  selected.  Because  of  uncertainty,  an 
estimate  of  the  real  value  of  the  parameter  6  may  contain  error;  therefore,  for  a 
particular  choice  of  the  estimator  0,  a  prediction  error  will  result  with  some  associated 
loss.  Modeling  this  estimation  process  as  a  decision  process,  the  Bayes  point 
estimator  may  be  determined  such  that  the  expected  loss  associated  with  the  error  in 
prediction  is  minimized.  Mathematically,  if  0  is  the  estimator  of  a  parameter  0  whose 
actual  value  is  described  by  a  distribution  f(0),  the  expected  loss  due  to  error  in 
prediction  is 


g(9, 0)  f(9)  d0 


(4.8) 


where  g(0,0)  is  the  "loss  function"  for  given  values  of  6  and  0.  The  Bayes'  estimator 
is  based  on  the  point  estimate  that  minimizes  the  expected  loss;  thus,  it  may  be 
obtained  from  the  following  relationship: 


3L  _  —  f (0)  de  =  0  (^.9) 
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Therefore,  depending  on  the  form  of  the  loss  function,  g(0,0),  the  optimal  choice 
for  the  estimator  may  be  different.  For  example,  if  the  loss  function  is  quadratic, 
that  is 


g(8,0)  '  (6-  8)2 

the  Bayes  estimator  can  be  shown  to  be  the  mean-value  of  6;  whereas  if  the  loss  function 
is  linear,  that  is 


g(e,e)  -  (e  -  e) 


then  the  Bayes  estimator  is  the  median  value  of  9. 
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le  Size 


In  addition  to  the  selection  of  an  estimator  on  the  basis  of  available  sample 
data,  other  questions  that  may  be  addressed  pertaining  to  statistical  sampling  would 
include  the  following. 

1.  Should  a  sampling  program  be  implemented? 

2.  If  sampling  is  necessary,  how  much  sampling  should  be  done;  i.e.  what 
should  be  the  sample  size? 

Ihe  problem  of  determining  the  optimal  sample  size  may  also  be  modeled  as  a  preposterior 
analysis.  Suppose  X  is  a  random  variable  with  parameter  6  that  needs  to  be  estimated 
with  the  sample  data.  A  decision  tree  that  includes  the  determination  of  the  optimal 
sample  size  is  shown  in  Fig.  4.3.  Three  phases  of  the  decision  process  may  be 
identified.  The  first  phase  of  the  process  pertains  to  wehther  or  not  sampling  should 
be  conducted  (at  node  A).  If  sampling  is  the  preferred  action,  then  a  determination 
of  the  sample  size  (at  node  B)  would  be  needed.  After  the  sample  data  have  been 
collected,  an  estimator  0  (at  node  C)  is  selected.  The  total  loss  for  each  of  the 
paths  involving  sampling  will  depend  on  the  sample  size  n,  and  the  estimated  value  of 
the  parameter  relative  to  its  true  value. 

The  pertinent  decision  analysis  would  start  at  the  last  node  of  the  decision 
tree.  The  expected  loss  for  a  given  n,  (x),  and  estimate  0,  is  given  by  weighing 
the  loss  over  the  posterior  distribution  of  0  as  follows: 

E(Ljn,{x},8)  *  (  L(n,{x},9,6)  f"<0)  d0  (4.10) 


At  node  C,  the  optimal  estimate  0  is  the  value  which  minimizes  the  expected  loss  of 
Eq.  4.10;  that  is,  from 


E(L[n,(x} ,9)  =  0 


(4.11) 


Eq.  4.11  should  yield  the  optimal  estimator,  0  ,  on  the  basis  of  which  the  expected 

opt 

loss  for  a  given  sample  size  n  (at  node  D)  becomes 


E(L|n)  =  E(L|n,{x),0  )  f{x)  d{x) 

j  opt 

The  required  optimal  sample  size,  nQpt>  roay  then  be  evaluated  from 

dE(L|n)  _  Q 
dn 


(4.12) 


(4.13) 
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Figure  4.3  Decision  tree  for  optimal  sample  size 
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Example 

Suppose  that  an  exploration  program  is  planned  for  determining  the  shear  velocity 
of  rocks  at  different  sites  for  the  construction  of  a  military  system.  The  mean  shear 
velocity  of  the  rock  stratum  is  of  principal  interest  for  engineering  purposes;  from 
this  information,  the  elastic  modulus  and  constraint  modulus  of  the  rock  may  be  esti¬ 
mated.  The  shear  velocity  of  the  rock  may  be  assumed  to  follow  a  normal  distribution. 

Assume  that  the  loss  due  to  any  error  in  the  estimated  mean  velocity  is  proportional 
to  the  square  of  the  error,  and  the  cost  per  unit  of  squared  error  is  related  to  the 
cost  of  performing  field  measurements.  For  simplicity,  assume  also  that  the  c.o.v.  of 
the  shear  velocity  of  the  rock  (representing  the  inherent  variability  of  the  rock 
stratum  plus  any  error  in  measurement)  is  about  15%. 

The  determination  of  the  optimal  number  of  measurements  is  of  interest.  Two 
cases  may  be  considered;  namely, 

(1)  no  prior  information  on  the  mean  shear  velocity  is  available; 

(2)  prior  information  on  the  mean  velocity  of  similar  rocks  is  available; 
hypothetically  suppose  that  this  is  N(4,000  fps,  400  fps). 

The  loss  function  may  be  expressed  as 

“  '  2 

L(n,{x),  p,p)  =  c(u  -  p)  +  kn 


where,  p  denotes  the  mean  shear  velocity  of  the  rock,  which  is  the  parameter  to  be 
estimated;  p  is  the  estimator  for  p;  c  is  the  cost  per  unit  of  squared  error  in  the 
estimator;  and  k  is  the  cost  of  one  measurement.  This  loss  function  assumes  that  the 
loss  due  to  error  in  p  is  quadratic,  and  the  cost  of  sampling  is  linear  with  sample 
size  n.  Applying  Eq.  4.10, 


E(L| n, { x} , p) 


[c(p-p)2  +  kn]  f"(p)  dp 


=  cE"(p-p)2  +  kn 
V 


=  cE"  [(p-p")  +  (p”-p)]2  +  kn 
P 


=  cVar(p)  +  c(p"-p)2  +  kn 


where  p"  and  Var”(p)  are  the  posterior  mean  and  variance  of  p,  respectively. 
Applying  Eq.  4.11, 


dEOLn^iO  _  2  c(u„_;)  .  0 
d  u 
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r 


from  which  the  optimal  estimator  is. 


(p)  .  =  •*" 

opt 

which  means  that  the  optimal  estimator  is  the  mean-value  of  the  posterior  distribution 
of  p . 

The  corresponding  expected  loss  then  is 


E[ L,|  n ,  { x)  ,p  )  =  c  Var" (p)  +  (p"-p  )2]  +  kn 

opt  opt 


=  c  Var" (p)  +  kn 


The  shear  velocity  of  the  rock  is  Gaussian  with  known  standard  deviation  a;  whereas, 
the  posterior  variance  of  u  is  given  by  Eq.  4.5.  Hence,  the  maximum  expected  loss  is. 


E(L | n, { x} , p  )  =  c 

opt 


(ol)2io2/n) 
2  2 
(o')  +a 


where  p'  and  o'  are  the  prior  mean  and  standard  deviation  of  the  mean  shear  velocity  p. 

From  Eq.  4.12,  the  expected  loss  for  a  given  number  of  sample  measurements  n, 
is  thus 


E(L[ n) 


[c  ^ 


(o')' 


(q2/n) 

2 

+  o  /n 


+  kn]  f({x})  d{x) 


-  IsJJL  (o2/n)  . 

=  c  — - „ - r--  +  kn 

(a’)Z  +  (oZ/n) 


For  the  case  in  which  no  prior  information  on  p  is  available,  the  estimate  must  be 
based  purely  on  the  current  field  measurements.  In  such  a  case,  the  variance  of  p 
is  simply 

Var"(p)  =  02/n 

Hence, 


i 


E(L|n)  =  c 


2, 

a  /n 


+  kn 
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The  optimal  number  of  measurements  is  given  by  the  solution  to  the  following 
equation. 

=  _  c  (°f  )  +  k  =  0 

an  i 

n 

from  which  the  optimal  sample  size  is. 


Hence,  eleven  measurements  is  the  optimal  sample  size  for  the  site  exploration  program. 

In  the  case  where  prior  information  on  the  mean  shear  velocity  is  available,  the 
optimal  sample  size  would  be  as  follows: 


Hence,  in  this  case  the  optimal  number  of  measurements  is  9,  indicating  that  the  prior 
information  is  worth  approximately  two  measurements. 
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V.  EVALUATION  OF  FIELD  AND  LABORATORY  DATA 

5 . 1  Introductory  Remarks 

The  concepts  and  tools  presented  in  Chapters  II  through  IV  may  be  used  in  a  number 
of  situations  of  strategic  significance.  A  few  of  these  are  illustrated  in  the  following. 
The  potential  areas  of  application,  of  course,  are  not  limited  to  those  described  here¬ 
in;  indeed,  the  possible  areas  of  applications  could  be  quite  broad.  In  the  final 
analysis,  the  applications  areas  are  limited  only  by  the  imagination  and  creativity  of 
the  user  in  applying  the  basic  concepts  described  herein.  As  alluded  to  earlier,  the 
application  of  the  general  concepts  is  often  not  straight-forward,  as  most  significant 
applications  require  modeling  and  conceptual  formulations  of  the  underlying  physical 
problem.  In  this  light,  the  examples  of  applications  described  in  the  sequel  are  merely 
to  demonstrate  the  general  significance  of  statistical  decision  concepts,  and  to 
illustrate  the  applications  to  specific  situations  involving  strategic  planning  and 
decision  making. 

5 . 2  Field  versus  Laboratory  Tests 

One  of  the  prime  objectives  of  a  test  program,  either  field  or  laboratory,  is 
often  to  validate  or  revise/update  some  prior  theoretical  or  empirical  relationship 
for  predicting  conditions  in  the  real  world,  or  for  extrapolating  prior  observations 
beyond  the  range  of  available  data.  In  statistical  terms,  a  prediction  is  usually 
given  in  terms  of  the  mean  value  and  associated  standard  deviation  (or  coefficient 
of  variation).  Invariably,  such  predictions  are  imperfect  and  thus  will  contain 
inaccuracy  and  uncertainties.  For  example,  the  predicted  or  estimated  mean-value  x 
of  a  variable  X  may  contain  bias,  which  is  a  systematic  error,  such  that  the  correct 
mean-value  would  be 


where  v  is  the  mean  bias  factor  necessary  (even  if  only  to  be  determined  subjectively) 

to  obtain  the  correct  mean-value  u  . 

x 

In  addition,  there  may  also  be  statistical  error  in  the  estimated  mean-value  x. 
That  is,  suppose  that  x  is  the  estimated  sample  mean  from  a  sample  of  size  n.  Then, 
conceptually,  if  additional  samples  of  the  same  size  were  to  be  obtained,  there  could 
conceivably  be  some  scatter  in  x;  this  scatter  is  given  by  o  /n,  which  decreases  with 
the  sample  size  n  and  represents  only  the  random  error  due  to  sampling.  However,  there 
may  be  other  factors  that  could  contribute  additional  random  errors  to  x,  which  may 
be  represented  by  the  coefficient  of  variation  A^.  The  c.o.v.  A^  may  be  much  larger 
than  the  sampling  error  if  predictions  are  based  entirely  on  laboratory  and  theoretical 
results;  whereas  it  may  be  negligible  If  actual  field  data  were  used. 
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In  any  case,  test  results  are  particularly  useful  for  evaluating  or  improving 
the  mean  bias  factor  v  of  a  prediction,  as  well  as  for  evaluating  the  c.o.v.  . 
Either  field  data  or  laboratory  test  data,  or  combinations  thereof,  are  useful  for 
these  purposes. 

Field  Tests  —  If  field  test  results  are  available,  they  can  be  used  to  update 
any  prior  information  as  follows: 

Let  B  ■  the  bias  in  the  prediction  of  the  real  world.  Then,  assuming  that  the  result 
of  a  field  test  is  a  realization  of  the  real  world,  the  bias  is 


b  -  yx, 


where : 


Xp  =  field  observation,  which  is  a  realization  of  the 
real  world,  and; 

Xp  =  a  prediction  of  the  real  world 


If  the  sample  size  of  the  field  data  is  n  ,  the  mean  of  the  individual  ratio 

r 


(Xpi/XpJ  should  yield  the  mean  bias 

4  |  (W 


B  = 


(5.2) 


Also,  from  the  same  set  of  field  data,  the  error  in  B  is. 


“B  ’  /r 


(5.3) 


where  ftg  =  Og/B^,  and  Cg  is  the  sample  standard  deviation  of  the  ratios  x^/x^. 

In  other  words,  B  is  the  inaccuracy  of  the  prediction.  From  n  sample  field 

r 

data,  the  mean  bias  may  be  estimated  as  in  Eq.  5.2.  Also,  from  the  same  set  of  data, 
the  c.o.v.  of  the  mean  bias  B  can  be  estimated  as  in  Eq.  5.3. 

Laboratory  Tests  —  In  the  case  of  laboratory  test  results,  the  total  inaccuracy 
in  the  prediction  may  be  divided  into  two  parts.  First,  the  prediction  may  be  bias 
relative  to  the  laboratory  tests;  in  addition,  there  may  be  systematic  difference  between 
laboratory  and  field  test  results.  Therefore,  the  total  inaccuracy  of  a  prediction  may 
be  represented  as. 


XF  \ 

B  -  (/)  (t^)  -  C  x  A 
*L  *P 


(5.4) 


The  ratio  C  “  X^/X^  sa!'  called  the  "calibration  factor"  representing  the  error  or 
bias  of  laboratory  tests,  whereas  A  ■  X^/Xp  is  the  ratio  of  laboratory  data  to  the 
prediction. 


From  a  set  of  n  laboratory  test  data,  the  mean  bias  between  a  prediction  and 
i-< 

laboratory  tests  may  be  estimated  as 


1  i  ^  (xld/xPi) 


(5.5) 


The  mean  bias  of  laboratory  test  results,  i.e.  C,  may  have  to  be  assessed  judgmentally , 
unless  there  are  prior  field  and  laboratory  data  to  permit  its  estimation.  The  total 
mean  bias,  therefore,  is 


B  =  C  x  A 


(5.6) 


From  the  same  set  of  laboratory  data,  the  c.o.v.  of  A  can  be  estimated  as 

0—  =  i  /o^/n,  (5.7) 

A  A  A  L 


where  a  =  the  sample  variance  of  the  ratios  (x^/x^).  Thus,  the  total  c.o.v.  of 


B  becomes 


nB  =  /nC  +  nA 


(5.8) 


where  fl--  is  the  c.o.v.  of  C. 

5 . 3  Posterior  (Updated)  Information 

Assuming  that  the  prior  estimates  of  the  prediction  error  are  respectively  B' 

and  n'— ,  the  test  results  (either  field  or  laboratory)  may  be  used  to  update  or 
B 

improve  these  prior  estimates,  obtaining  the  posterior  estimates  B"  and  fi"—  as 
follows : 


B'  ($l_  B)2  +  B («’ -  B’)2 

g!I  _ _ JL _ _ _ 5 _ _ 

(n-  B) 2  +  (SI'-  B’)2 


(5.9) 


and. 


a"—  = 

“  B  B’ 


L  (fig  B)(J2'-b-  B’) 


(5.10) 


'(a-j-B)2  +  (n'-B-  b')2 


where  B  and  SI—  are  as  given  by  Eqs.  5.2  and  5.3,  or  Eqs.  5.6  and  5.8. 

B 

In  the  case  where  the  test  results  consist  of  both  field  and  laboratory  tests, 
the  two  types  of  tests  may  be  combined  to  give 


V\  v2  +  v2 

(n«  b .)2  +  (n5  b  ) 2 

Bh  L  BF  F 


(5.11) 
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(5.12) 


(Of  B  )  (ftr  B  ) 

1  bf  F  bl  L 

o  "  FT 


/ 


B„) 2  +  (Sir-  Bx)2 
Bf  F  BL  L 


where  B„  and  Sl5  are  given  by  Eqs.  5.2  and  5.3;  whereas  B,  and  Sir  are  given  by  Eqs. 

F  B  L  Bl 

5.6  and  5.8.  These  may  then  be  used  to  update  any  prior  estimates  B'  and  SI'—; 

O 

obtaining  the  corresponding  posterior  estimates  as  shown  in  Eqs.  5.9  and  5.10. 

5.4  Measure  of  Information 

In  order  to  define  or  establish  a  utility  measure  for  a  test  plan,  some  measure 
of  information  gained  from  the  experiment  is  obviously  important,  in  addition  to  the 
cost  of  the  experiment.  For  this  purpose,  observe  the  following. 

Perhaps  of  first  order  importance  from  a  test  plan  is  the  evaluation  of  the  mean 
bias  factor  of  a  prediction;  in  this  regard,  the  utility  function  may  be  defined  in 
a  quadratic  form  as  follows  (or  other  forms  may  be  prescribed) : 

U|y-  ,  B  =  -k(B  -  y-)2  (5.13) 

where, 

u^-  **  the  true  mean  bias; 

B  *  estimated  mean  bias  based  on  the  experimental  data. 

Assuming  that  y—  is  a  random  variable,  the  expected  value  of  Eq.  5.13  with  respect 

D 

to  y-  is, 

E(u|b)«  -k  E  (B  -  y-)2  -  k  Var(B)  -  -  k(fc-  B)2  (5.14) 

llB  B  B 

Finally,  taking  the  expected  value  over  all  values  of  B, 

E(U)  -  k  E-b-  (figB)2  -  -  kS22-  [Var(B)  +  E2(B)]  (5.15) 

—  2  — 

Observe  that  Var(B)is  a  second  order  term,  i.e.  <<1.0;  whereas  E  (B)  -  1.  Hence 

E(U)  -  -  k  n|  .  (5.16) 

Similarly,  the  utility  function  may  be  defined  in  the  form, 

0|uf,  B  -  -k1(B  -  y-)2  +  k2(y-  -  1)  (5.17) 
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or 

u|ug,  B  =  -k^B  -  u-)2  +  k2(B  -  1)  (5.18) 

In  either  of  these  latter  cases,  it  can  also  be  shown  that  the  expected  utility  is 

ECU)  =  -k^2  (5.19) 

In  short,  the  above  results  show  that  if  a  quadratic  utility  function  is  appropriate, 

the  measure  of  information  content  can  be  defined  solely  in  terms  of  the  c.o.v.,  Si  — 

Therefore,  the  information  gained  from  a  test  program  would  be  inversely  pro- 
2 

portional  to  SJg-.  The  cost  of  a  test  program  obviously  is  directly  proportional  to  the 

size  of  the  program,  i.e.  sample  size  n.  Consequently,  the  utility  of  a  test  program 

2 

may  be  defined  as  a  function  of  the  cost  per  unit  of  information  gained,  or  cSlz. 

B 

Therefore,  the  "best"  experiment  may  be  the  one  that  minimizes  the  cost  per  unit 
information  gain. 

Example 

For  illustration,  consider  the  determination  of  the  bearing  capacity  of  a  large 
site  for  the  design  of  the  foundation  of  a  military  facility.  For  this  purpose,  assume 
that  the  test  options  include: 

(1)  load  bearing  capacity  tests  in  the  field; 

(2)  unconfined  compression  tests  of  laboratory  soil  samples  obtained  from  boring, 
and  subsequently  calculating  the  bearing  capacity  based  on  the  soil  parameters 
obtained  from  such  tests. 

Obviously,  load-bearing  capacity  tests  in  the  field  are  expensive  and,  therefore, 
only  a  limited  number  of  such  tests  may  be  performed  (if  at  all).  However,  more 
extensive  soil  samples  may  be  obtained  over  the  entire  site. 

Suppose  that  five  load  tests  were  performed;  the  locations  of  the  tests  were 
spaced  sufficiently  far  apart  to  cover  the  entire  area  of  the  site.  At  each  of  the 
load  test  locations,  assume  that  one  soil  sample  is  also  taken  which  may  be  considered 
to  be  the  "control  samples".  The  measured  bearing  capacity  from  the  load  tests,  and 
the  corresponding  bearing  capacity  estimated  from  the  control  soil  samples,  may  be  as 


follows: 

Bearing 

Control  Soil 

Bearing 

Ratio 

Load  Tests 

Capacity  xp 

Sample 

Capacity  xL 

XF/XL 

1 

3,500  psf 

1 

4,000  psf 

0.875 

2 

3,000 

2 

3,000 

1.00 

3 

2,500 

3 

3,200 

0.781 

4 

4,000 

4 

3,500 

1.143 

5 

4,300 

5 

4,500 

0.956 

For  the 

remainder  of  the  site, 

only  soil  samples  are 

taken;  say  fifteen 

more  were 

obtained  in  addition  to  the  control  samples,  giving  results  as  follows: 


45 


t 


Soil  Sample  Bearing  Capacity, 


4,300  psf 

7  -  3 ,600 

8  -  3,800 

9  -  4>700 

10  -  4,200 

11  -  3 ,900 

12  -  3,300 

13  -  2,900 

14  -  4,800 

15  -  4,000 

16  -  3,700 

17  -  3 , 600 

18  -  4,100 

19  -  3,900 

20  -  4 ,500 


Assume  that  the  soil  type  over  the  entire  area  of  the  site  is  fairly  uniform,  to  permit 
the  use  of  a  uniform  bearing  capacity  for  the  entire  site  (if  the  soil  type  is  not 
uniform,  the  area  may  be  divided  into  subareas).  The  results  of  the  field  and  laboratory 
tests,  describe  hypothetically  above,  then  can  be  used  as  follows. 

From  the  field  test  results,  the  mean  bearing  capacity  is 

Xp  =  3460  psf; 

and  the  corresponding  c.o.v.  is 

S2V  =0.21 

XF 

Also,  the  uncertainty  (c.o.v.)  in  the  estimated  mean  bearing  capacitv  is 

fte-  =  =  0.09 

*p  JT 

On  the  basis  of  the  laboratory  soil  samples,  the  mean  bearing  capacity  is  calculated 

to  be 


=  3875  psf; 

and  the  corresponding  c.o.v.  is, 


from  which  the  c.o.v.  of  3?  is 

L 


0.14 


0.14 

/20 


0.03. 


Comparing  the  bearing  capacity  estimated  using  the  control  soil  samples  with  the 
corresponding  field  test  results,  it  is  obvious  that  x^  tends  to  overestimate  the  actual 
value  of  the  bearing  capacity  at  the  site.  In  otherwords,  there  is  a  systematic  bias 
in  the  laboratory-based  bearing  capacity  estimates;  i.e.  the  correct  bearing  capacity 
may  be  expressed  as 

X  =  CXL 

where  C  is  the  bias  in  the  laboratory-based  bearing  capacity  estimate  for  the  site. 

From  the  ratios  of  X„/X  given  in  the  above  table,  the  mean  value  of  C  is 


C  =  0.951 

and  the  associated  c.o.v.  is 


=  0.143 


C  0.951 
whereas  the  c.o.v.  of  C  is, 

0.143 


A  =  m  0 . 064  . 

C  /5 


Thus,  the  total  uncertainty  underlying  the  laboratory-based  mean  bearing  capacity  is 

■  R  f 


064 2  +  0.032  =  0.071 


The  load  test  data  as  well  as  the  unconfined  compression  laboratory  test  data 
described  above  are  useful  for  estimating  the  bearing  capacity  for  the  site  in  question. 
That  is,  both  sets  of  data  can  be  combined  in  accordance  with  Eqs .  5.11  and  5.12,  to 
give  a  composite  estimate  of  the  bearing  capacity  for  the  site  as  follows: 

-  =  3460(0. 071x0. 951x3875)2  +  0. 951x3875(0. 09x3460)2 
(0. 071x0. 951x3875)2  +  (0.09x3460)2 


-  3592  psf • 


“x  “  3592 


(0.09x3460) (0.071x0. 951x3875) 

/(. 09x3460)2  +  (0. 071x0. 951x3875)2 


-  0.06. 
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VI.  ANALYSIS  AND  PLANNING  OF  TEST  PROGRAMS 


6.1  Introduction 

In  test  programs  that  are  extremely  costly,  such  as  the  Mighty  Epic  and  the  Diablo 
Hawk  test  programs,  the  need  for  a  systematic  framework  for  planning  the  experiments  is 
clear.  The  purpose  and  objective  of  any  test  program,  of  course,  is  to  obtain  informa¬ 
tion  to  improve  the  existing  state  of  information  and  knowledge. 

A  test  plan,  therefore,  should  be  to  obtain  the  maximum  benefit,  which  may  be 
defined  in  terms  of  information  gained  or  improvements  accruing  from  the  tests. 

The  basic  concepts  of  statistical  decision  may  be  used  to  advantage  in  the  planning 
of  test  programs;  however,  the  application  of  these  concepts  require  modeling  of  the 
specific  problem  underlying  a  particular  test  program.  In  view  of  this,  the  models 
developed  herein  will  refer  to  test  programs  similar  to  the  Mighty  Epic  or  the  Diablo 
Hawk  programs. 

In  particular,  referring  to  test  programs  such  as  the  Diablo  Hawk,  the  programs 
involve  the  testing  of  specified  diameter  tunnels  subjected  to  nuclear  weapons 
effects.  Decisions,  therefore,  are  required  relative  to  the  choice  of  diameters  of 
the  tunnels  to  be  tested,  the  location  of  the  tunnels  relative  to  the  blast  point 
which  may  be  given  in  terms  of  the  maximum  strain  at  which  the  tunnels  may  be  subjected, 
as  well  as  the  number  of  tunnels  (or  sections  of  tunnels)  of  a  given  diameter. 

An  approach  to  this  problem  is  developed  and  formulated  below.  Before  describing 
the  approach,  consider  the  following. 

6.2  Preliminary  Consideration 

Presumably,  the  failure  strain  of  a  tunnel  with  given  diameter  is  random  as 

shown  in  Fig.  6.1.  Moreover,  it  may  be  assumed  that  the  failure  strain  depends  on  the 

diameter  of  the  tunnel;  that  is,  there  is  a  size  effect  which  may  be  a  regression 

relation  (could  be  nonlinear  between  the  mean  failure  strain  and  the  diameter  of  the 

tunnel),  as  illustrated  hypothetically  in  Fig.  6.2. 

Suppose  that  there  is  some  prior  information  on  the  failure  strain  of  tunnels  with 

given  diameter  d,  to  establish  its  probability  distribution  f  (e),  with  mean  u  and 

Ef  c  ^ 

a  .  Furthermore,  assume  that  a  is  known,  whereas  p  is  a  random  variable 
ef  ef  e{ 

with  prior  distribution  f^  (x) ,  whose  mean  value  is  u'  and  standard  deviation  o'. 

Then,  if  one  tunnel  of  diameter  d  is  tested  at  a  given  strain  e,  the  result  can 

be  used  to  revise  and  update  the  prior  probability  distribution  f^  (x)  as  follows: 

If  the  tunnel  survives  the  test  at  strain  e,  i.e.  >_  e,  the  posterior  distribtuion 

for  u  is, 
ef 
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Tunnel  Diameter  d 


Figure  6.2  Failure  strain  versus  tunnel  diameter 


f"  (x  |  e,  >_  c)  =  P(y  =  x  |  e  _>  e) 
n£  i  ef  t 


p(p  =  xf)Efi£) 


P(£f  >.  e) 


P(e,  >  c | y  =  x)P(y  =x) 

f  ~  cf _ c_f_ 

P(ef  >_  e) 


[1  -  F  (e  j  y  =  x)] 
-  f  f 

[1  -  F  (e) ] 


f’  (x) 


(6.1) 


whereas,  if  Che  tunnel  fails  at  the  test  strain  e,  i.e.  c  <  e  ,  then  the  corresponding 
posterior  distribution  for  y  would  be 


f"  (x  e  <  e)  =  P(y  =  x|e  <  e) 
y£  '  f  e£  f 


F.  (c|u  =  x) 
f  "f 

F  (c) 

Ef 


£’  (x) 

U 

e 


(6.2) 


The  updated  mean  and  standard  deviation  of  the  mean  failure  strain  then  become 


y 


If 


f"  (x)  dx 
y 

e 


(6.3) 


11/2 

f  (x)  dx  (6.4) 

ue  1 

6.3  Measure  of  Benefit  (i.e.  Information  Gain) 

Ordinarily,  there  may  be  prior  information  on  the  mean  failure  strain,  say  u', 
and  on  its  uncertainty  a'  (even  if  only  based  on  subjective  judgments). 

The  results  of  a  test  or  tests  can  then  be  used  to  update  the  prior  information 
to  obtain  the  posterior  mean  y"  and  associated  uncertainty  o".  The  benefit  of  an 
experiment,  therefore,  may  be  a  function  of  the  difference  between  y"  and  a"  relative 
to  the  prior  values  y'  and  o'.  Consequently,  in  defining  a  measure  to  represent  the 
benefit  of  a  test,  consider  the  i  lowing: 

If  no  tests  were  conducted,  the  prior  statistics  y'  and  o'  will  be  used  in  the 
planning  and  design  of  a  system.  Therefore,  if  there  are  large  errors  in  y'  or  in  o', 
the  loss  will  correspondingly  be  large,  whereas  if  these  priors  are  accurate  (i.e. 


50 


small  error),  the  resulting  loss  in  the  use  of  this  prior  information  directly  in 
design  will  be  correspondingly  small.  On  this  premise,  the  benefit  that  may  be  gained 
from  a  test  or  tests  can  be  defined  as  follows. 


U(u",j")  =  a(- 


P  ~  U 


-)2  + 


.2 


.-2 


.,2 


(6.5) 


where  p"  and  a"  are  the  updated  mean  and  standard  deviation  after  the  test  data  have 

been  obtained.  . 

o f  ^  —  on ^ 

The  second  term  in  Eq.  6.5,  ( - - —  )  ,  is  the  reduction  in  uncertainty  accruing 

m"  p'  2 

from  the  test  results;  whereas  the  first  term,  (— — — )  represents  the  reduction  in 
the  loss  that  would  otherwise  incur  had  there  been  no  tests  conducted. 

An  optimal  test  program  may  then  be  developed  to  obtain  the  maximum  potential 
benefit,  meaning  maximum  U. 

6.4  Optimal  Test  Strains 

Suppose  that  only  a  single  tunnel  is  to  be  tested  at  a  strain  level  e.  The 
question  then  is  "at  what  maximum  strain  e  should  the  tunnel  be  tested  in  order  to 
derive  the  maximum  benefit  from  the  test?" 

The  decision  variable  in  this  case  obviously  is  e .  If  a  tunnel  is  subjected  to  a 
test  strain  e,  two  possibilities  may  occur  —  the  tunnel  may  survive  the  strain  level 
c  (event  S) or  it  may  fail  (event  F  ).  The  simple  decision  tree  in  this  case  may  be 
represented  as  shown  in  Fig.  6.3. 


{p’j\<^  }  -  a}') 


(u'^  a j  )  -  U(p^,o-p 

s  =  (ef  >  e) 

F  =  (ef  <  e) 

Figure  6.3  Decision  tree  for  test  strain 

For  one  tunnel,  the  results  of  the  test  may  either  be  that  the  tunnel  survives  (S) 
or  fails  (F)  when  subjected  to  the  load  that  Induces  the  strain  e.  Depending  on 
whether  the  tunnel  survives  or  fails  at  the  strain  level  e  ,  the  corresponding  posterior 
mean  and  standard  deviation  may  be  denoted  as  (p£,  o'p  and  (p^,  a'p  .  The  corresponding 
utility  would  be  U(p^,  op  and  Ufpp  op  as  given  by  Eq.  6.5. 
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Hence,  the  expected  utility  of  a  test  at  strain  level  e  is 
E(u|e)  =  U(^,  op  •  P(ef  >_  c) 

+  U(Up  op  P(cf  <  e)  (6.6) 

in  which  the  posterior  quantities  u"  and  a"  may  be  obtained  through  Eqs.  6.3  and  6.4. 

The  optimal  test  strain,  e  ,  may  then  be  obtained  on  the  basis  of  the  maximum 

opt 

expected  utility  criterion,  as  follows: 


(6.7) 


This  means  that  if  a  single  tunnel  of  a  given  diameter  were  to  be  tested,  it  should 
be  placed  at  such  a  distance  from  the  blast  point  at  which  the  induced  maximum  strain 
is  e 

opt 

Generalizations  —  The  basic  procedure  described  above  for  determining  the  optimal 
test  strain  for  one  tunnel  can  be  extended  to  any  number  of  tunnels  of  the  same  diameter. 
Consider  first  the  case  of  two  tunnels. 

If  two  tunnels  are  to  be  tested,  the  strains  and  e ^  that  the  tunnels  should 
be  subjected  to  can  be  determined  also  on  the  basis  of  maximum  expected  utility,  in 
the  sense  of  Eq.  6.7. 

The  decision  tree  in  this  case,  is  as  shown  in  Fig.  6.4. 


(u'pa'p  -*  iKu’pct”) 
{p’pap  -  U(y”,ap 
{y’pop  -  UGipop 

K’°4}  "  U(V4*°4) 


Figure  6.4  Decision  tree  for  test  strains  of  two  tunnels. 
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As  shown  in  Fig.  6.4,  the  test  results  for  the  two  tunnels  could  conceivably  be  that 
both  tunnels  survived  or  one  of  them  survives  whereas  the  other  fails,  or  both  tunnels 
fail,  at  strains  e^  and  respectively;  i.e.  there  are  four  possible  outcomes,  t*^, 

02 ,  0^,  8^.  Corresponding  to  each  of  these  four  possible  outcomes  of  the  set  of  two 
tunnels,  the  resulting  posterior  means  and  standard  deviations  are  mV  and  o'^,  i  =  1,  2, 
3,  4. 

The  expected  utility  at  the  test  strains  and  £2  is,  therefore, 

E(U|c1,c2)  =  P(S1,S2)  +  U(m^  ,o’p  P(Sj,F2) 

+  U(m'^,  o")  P(F1,S2)  +  U(p”,o^)  P(F1,F2)  (6.8) 

where,  assuming  that  the  outcomes  between  the  two  tunnels  are  statistically  independent, 

P(S1,S2)  =  P(ef  ex)  P(cf  >.  e2) 

P(S1,F2)  -  P(ef  1  e^)  P(cf  <  c2) 

P(F1,S2)  =  P(cf  <  cx)  P(cf  1  r2) 

P(F1,F2)  =  P(cf  <  cx)  P(ef  <  £ 2> 

in  which; 

r . 

1 

P(c  <  e =  f£  (x)  dx 

0  f 

where  f  (x)  is  the  probability  density  function  of  the  failure  strain  as  depicted 
in  Fig.  £.1. 

And, 

P(cf  21  e^)  =  1  -  P(ef  <  Ej)  • 

Also, 


where: 


(x)  dx 

up2  f"(x)  dx;  j  =  1,  2,  3,  4 
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f'^(x)  «  P(pt  -  x|f.£  >  tL,  ef  >  e2) 

P(lJef‘  XH£f  >  El’  £f  >  e2> 

P(ef  >  El*  £f  "  C2) 

Assuming  Chat  the  outcomes  of  the  two  test  tunnels  are  statistically  independent, 

P(e£  i  e1l^c  =  x>  P(e£  1  e 2  I  M t-  =  x> 

f"(x)  =  - - - — -  f’(x) 

P(ef  >  tj)  P(ef  >  c2) 

where  f'(x)  is  the  prior  distrubtion  of  the  mean  failure  strain  u 


Similarly, 


P(e£  e£  | U£  =  x)  P(ef  <  e  |u£  =  x) 

f"(x)  =  - - - - -  f '  (x) 

P(e£  1  e^)  P(ef  <  c2) 


P(ef  <  e1|pe  =  x)  P(ef  _>  e2|pf  =  x) 

f’4(x)  =  - - - - -  f’(x) 

P(cf  <  tj)  P(ef  e2) 

P(ef  <  e1|u{.  =  x)  P(e£  <  e2|uc  -  x) 

fV(x)  »  - - - - - -  f(x) 

P(ef  <  c^)  P(c£  <  c 2 ) 

The  optimal  test  strains,  (e.,  c  )  ,  may  then  be  obtained  on  the  basis  of  the  maximum 

I  L  opt 


expected  utility  as  follows: 


3E(U|eiPc2) 


3E(u|e1,e2) 


The  generalization  of  the  above  formulations  to  n  tunnels  of  the  same  diameter, 

tested  at  strains  e^,  . En»  may  be  portrayed  with  the  decision  tree  shown  in 

Fig.  6.5  below. 


""<VV 

“"‘W 


Test  strains  {c^> 
for  n  tunnels 


Test  results: 
k  tunnels  survive 
9  tunnels  fail 
k  +  Z  =  n 


Posterior  estimate 
of  the  parameters 


Figure  6.5  Decision  tree  for  test  strains  of  n  tunnels 


The  expected  utility  then  may  be  represented  as. 


E  (U  |{  £  ± } )  =  I  P(0p  •  U(p^,o'’) 
all  i 


(6.10) 


in  which 


and. 


p(0.)  =  n  (P(e  >  E  )  •  n  P(c  <  f  ) 
iek  it! 


uV  =  x  f"  (x)  dx 

1  *0  lj 


ov  =  1  (x-w")  f"  (x)  dx 

1  1 0 


Thus,  the  utility  associated  with  outcome  6  is, 

py  2  °;2  -  -f 

u(iji*  "P  “  a<~Vp  +  <  -  > 

°i 

from  which  the  expected  utility  of  Eq.  6.10,  and  the  optimal  set  of  test  strains 

{t.t  „  may  be  obtained  from 
i  opt  1 
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0 


(6.11) 


Referring  to  Fig.  6.6  the  optimal  set  of  test  strains  is  obtained  at  node  B;  whereas, 
at  note  A,  the  optimal  number  of  tunnels  (of  the  same  diameter)  are  determined. 

The  above  analyses,  therefore,  would  yield  the  optimal  number  of  test  tunnels 
of  a  given  diameter,  and  a  corresponding  set  of  optimal  strain  levels  at  which  the 
various  tunnels  should  be  tested.  The  same  analysis  may  be  performed  for  tunnels  of 
other  diameters.  The  results  may  then  be  combined  with  those  of  the  other  test  tunnels 
(l.e.,  of  other  diameters)  to  obtain  the  expected  utility  (or  information  gain)  per 
unit  coat.  Repeated  evaluations  of  several  test  plans,  each  consisting  of  different 
combinations  of  tunnel  diameters,  should  provide  a  basis  for  identifying  the  optimal 


56 


test  plan  on  the  basis  of  maximum  expected  utility  per  unit  cost. 

Example 

The  concepts  formulated  above  are  difficult  to  illustrate  numerically;  much  of 
the  numerical  calculations  will  involve  extensive  numerical  integrations  making 
computer  calculations  necessary.  Nevertheless,  the  following  formulation  and  discussion 
may  serve  to  clarify  the  concepts  developed  herein. 

Suppose  the  diameter  of  full-size  prototype  tunnels  in  the  field  will  be  twenty 
feet.  Because  of  expense,  the  maximum  diameter  of  test  tunnels  may  have  to  be  limited 
to  ten  feet;  moreover,  the  results  obtained  for  10-foot  diameter  tunnels  may  be  con¬ 
sidered  to  approach  those  of  the  full-size  20-foot  tunnels.  Even  for  10-foot  tunnels, 
the  expense  of  testing  tunnels  of  this  size  may  be  so  high  that  the  number  of  such 
tunnels  must  also  be  limited;  the  use  of  smaller  diameter  tunnels  in  a  test  plan, 
therefore,  may  be  more  cost-effective. 

Consider  the  case  in  which  two  sizes  of  tunnels  are  to  be  used  in  a  test  program; 
namely,  tunnels  with  two-foot  and  ten-foot  diameters.  Furthermore,  assume  that  only 
two  10-foot  diameter  tunnels  will  be  tested,  whereas  a  larger  number  of  2-foot  tunnels 
will  be  economically  possible.  A  decision  on  the  strain  levels  at  which  the  various 
tunnels  should  be  tested  is  required  in  this  case. 

Consider  first  the  10-foot  tunnels,  at  test  strain  levels  and  c 2  for  the  two 
tunnels,  the  possible  results  or  outcome  from  the  tests  of  these  two  tunnels  will  be 
as  follows: 

(s1,s2),  (s1,f2),  (f1,s2),  (f1,f2) 

i.e.  one  or  both  of  the  tunnels  may  survive  at  the  specified  test  strains  and 
or  none  of  them  may  survive. 

Suppose  also  that  from  prior  observational  data,  supplemented  with  judgment, 
the  mean  failure  strain  of  ten-foot  diameter  tunnels  is  estimated  to  be  u',  and 
standard  deviation  o’.  However,  the  mean  failure  strain  contains  uncertainty;  hence, 
it  may  be  assumed  to  be  a  random  variable  with  normal  distribution;  i.e. 

f ’ (x)  =  (u *  ,  o') 

After  the  test  results,  the  posterior  statistics  of  the  failure  strain  may  be 
updated  or  revised  depending  on  which  outcome  is  realized.  For  example,  if  tunnel 
No.  1  survives  the  test  strain  whereas  tunnel  No.  2  fails  at  t~2,  i.e.  the  outcome 
is  (S^,F2),  then  the  posterior  statistics  for  the  mean  failure  strain  of  10-foot  tunnels 
are : 

u"  =  x  fj'(x)  dx 

o"2  -  (  (x  -  u")2  f ”(x)  dx 
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where 


f'J(x)  =  P(u  =  x|ef  >_  cv  cf  <  e2) 


Assuming  that  the  outcomes  between  the  two  tunnels  are  statistically  independent, 

p(Ef  1  S (n  *  *)  p(£f  <  t _ | u  =  x) 

f"(x)  =  - i ± - E - * -  •  f  '  (x) 

P(S1)  P(F2)  u 


e.-x  e  -x 

[1  -  *(-± - )]  H~~) 

a  a 

=  C£  Ef 

- i - 1 - f’(x) 

P(sx)  •  P(F2)  w 

in  which, 

e1  p' 

P (S.  )  =  P(e,  >£.)*1-  - ■> 

X  (2  2 

>o  +o 

Ef 

c2  V 

P(F  )  *  P(e  <  e  )  «  H-- . .  -> 

*  1  /2V  ,2 

o  +o 

V  cf 

and  4>(-)  is  the  cumulative  probability  of  the  standard  normal  distribution.  The 
posterior  distribution  f"  for  the  other  outcomes  can  be  similarly  determined. 

The  above  results,  of  course,  pertain  to  tunnels  with  10-foot  diameter.  For 
purposes  of  discussion,  suppose  that  the  above  analysis  yield  posterior  estimates  for 
the  mean  failure  strain  of  10-foot  tunnels  to  be 


and 


p"  =  2  x  10 
o"  =  0.4  x  10 


On  the  basis  of  the  results  for  10-foot  tunnels,  the  mean  failure  strain  for  the 
full-size  20-foot  diameter  tunnels  may  be  determined  as  follows: 


f f 20  “  A10  Cfl0 

where  A  is  the  bias  factor,  principally  to  take  account  of  anv  size  effect  (the  statis¬ 
tics  of  A  may  be  determined  from  prior  experience).  The  mean  failure  strain  for  Jo- 
foot  tunnels  then  may  be  expressed  as 
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Assuming  A  =  0.95  and  <5—  =  0.4, 

«=  0.95  (2x10"3)=1.90  x  10-3 

and  the  associated  c.o.v.  for  the  mean  failure  strain  of  20-foot  tunnels  is 


6 

u 


20 


+ 


(o'Vu") 


2 


+  0.2 


From  which. 


=  0.45 


o"  =  0.45  (1.90x10  3)  =  0.86  x  10 
”20 


At  this  point,  if  the  prior  information  is  and  °20*  w*1*c*1  may  be  results 
from  earlier  tests,  then  the  gain  in  information  (or  utility)  accruable  from  the 
test  of  the  two  10-foot  diameter  tunnels  becomes 


U 


u( 


20 


jv 


+  ( 


20 


20 


20 


20 


Observe  that,  implicitly,  U  is  a  function  of  t  ^  and  e Hence,  the  optimal  test 
strains  may  be  obtained  from  maximizing  U,  i.e. 


and, 

V-  =  o 

3c2 

The  results  would  be  the  optimal  strains  that  should  be  applied  to  the  two  10-foot 
tunnels.  Computer  calculations  will  be  necessary  to  obtain  these  results. 
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VII.  ANALYSIS  OF  A  RETALIATORY  SYSTEM 


7.1  Premise  of  Analysis 

Another  potential  application  of  statistical  decision  concepts  is  in  the 
evaluation  of  a  new  retaliatory  system,  such  as  the  MX  system  which  is  currently 
under  consideration  and  development.  (For  this  discussion,  consider  a  vertical 
shelter  configuration.) 

As  a  retaliatory  system,  the  principal  objective  is  to  insure  a  minimum  level  of 
retaliation  after  the  first  strike  by  a  potential  adversary.  In  the  case  of  the  MX 
system,  this  may  be  the  number  of  surviving  missiles  following  an  all  out  enemy  attack. 

The  number  of  missiles  that  may  be  estimated  to  survive  an  enemy  attack  will 
depend  on  the  following: 

(1)  The  number  of  shelters  in  the  total  system. 

(2)  The  probability  that  a  "loaded"  shelter  will  be  targeted  or  under  attack 
by  an  enemy  warhead . 

(3)  The  probability  that  a  missile  will  survive  if  it  is  under  attack. 
Mathematically,  the  number  of  surviving  missiles  may  be  expressed  as 

NSM  =  P(S|A)  P(A)  x  N 

m 

where : 

NSM  *  number  of  surviving  missiles  after  an  enemy 
attack; 

N  =  total  number  of  missiles  in  the  MX  system 
m  1 

P(A)  =  probability  that  a  "loaded"  shelter  will  be 
under  attack; 

P(S|A)  =  probability  of  a  missile  surviving  the  attack  A. 

The  probability  of  attack,  P(A),  will  depend  on  the  attack  strategy  deployed 
by  the  enemy;  i.e.  it  is  a  conditional  probability  P(AjD^),  where  is  the  strategy 
used  by  the  enemy,  such  as  the  "one-on-one"  or  the  "walk"  scheme.  In  the  "one-on-one" 
scheme,  the  enemy  attacks  with  one  warhead  for  each  shelter;  whereas,  in  the  "walk" 
scheme,  one  warhead  is  aimed  at  a  cluster  of  shelters. 

On  the  other  hand,  the  probability  of  survival,  P(S|A),  is  a  function  of  the 
hardness  and  the  weapon-placement  accuracy  of  the  enemy's  warhead.  Spacing  of  the 
shelters  may  also  be  relevant  if  the  enemy  deploys  the  "walk"  scheme;  however, 
spacing  may  not  be  pertinent  for  the  "one-on-one"  scheme. 

7. 2  The  Probability  of  Attack 

Depending  on  the  enemy's  arsenal,  there  may  be  more  than  one  option  for  his  attack 
strategy.  In  other  words,  there  may  be  several  potential  threats  to  a  retaliatory 
system.  Accordingly,  the  attack  probability,  P(A|D^),  will  further  depend  on  the  attack 
strategy  deployed  by  the  enemy;  moreover,  this  will  obviously  also  depend  on  the  quality 
of  the  enemy's  intelligence. 


The  One-on-One  Scheme  —  One  of  the  threats  to  the  MX  system  is  if  the  enemy 

chooses  to  deploy  one  warhead  per  given  shelter.  In  this  case,  the  probability 

that  a  "loaded"  shelter  will  be  under  attack  is  a  function  of  the  enemy's  intelligence 

and  the  number  of  successful  RV's  (re-entry  vehicles)  in  the  attack. 

For  the  one-on-one  thre.  the  probability  that  a  given  missile  mav  be  under 

attack  can  be  developed  as  follows: 

Let  N  =  number  of  successful  enemy  RV's  in  the  attack: 

e 

N  =  total  number  of  missiles  in  the  MX  system;  i.e.  the 
m  number  of  shelters  that  are  "loaded". 

N  =  total  number  of  vertical  shelters  in  the  system, 
s 

6  =  a  measure  of  enemy  intelligence,  expressed  in  terms  of 

a  fraction  of  loaded  shelters  that  are  known  to  or  can 
be  pin-pointed  by  the  enemy. 

Observe  that  if  the  enemy  has  0  intelligence,  and  chooses  to  deploy  the  one-and-one 

attack  strategy,  then  this  is  equivalent  to  targeting  the  missiles  at  random  among 

the  N  vertical  shelters.  This  means 
s 


s 


The  probability  of  attack,  if  the  enemy  uses  the  one-on-one  scheme,  therefore, 
is  given  by 


P(A 1 1/1)  -  B(N  /N  ) 
e  m 

For  example,  if  the  total  number  of  shelters  is  5,000,  and  among  these,  250  are  loaded, 
then  assuming  that  half  the  "loaded"  shelters  can  be  identified  by  the  enemy  then 


B  = 


250 

2,500 


1/10 


In  such  a  case,  the  probability  of  attack  becomes 

P(A | 1/1)  -  t~<N  /N  I  ). 

iu  e  m 

Therefore,  if  the  enemy  can  successfully  deliver  250  RV's,  the  probability  of  attack 
of  a  missile  is, 


P(A | 1/1)  -  0.1  x  250/250  -  0.10 

The  "Walk"  Scheme  —  In  the  case  of  the  "Walk"  scheme,  the  enemy  would  presumably 
aim  hia  RV's  at  a  cluster  of  shelters;  that  is,  several  (e.g.  '3  or  4)  shelters  mav 
be  subject  to  the  overpressure  from  a  single  weapon  burst. 
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Again,  suppose  that  there  is  a  total  of  N  shelters,  among  which  N  are  "loaded". 

s  m 

For  purposes  of  this  discussion,  assume  that  the  shelters  are  arranged  in  clusters  of 
4  in  a  square  as  shown  in  Fig.  7.1. 

O  o 


shelter 


Figure  7.1  Cluster  of  four  shelters  in  a  "square" 


Then, 


N 

=  proportion  of  the  total  number  of  shelters  that  the 
s  enemy  estimates  are  loaded. 


For  a  given  target  point  or  "square",  the  number  of  corners  in  the  square  that  the 
enemy  would  estimate  are  loaded  can  be  described  as  follows: 


.  N  N  . 

p(y  =  y>  =  (4)  (— V  (i  -  _ 

K  y>  y  'bn  '  v  bn  1 

s  s 


in  which,  Y  *  the  number  of  corners  that  the  enemy  will  think  are  loaded.  The 
expected  value  of  Y  is 

E(Y)  =  A(^) 
s 

Let  X  =  number  of  corners  that  arc-  actually  loaded.  The  expected  value  of  X  is, 

N 

E(X)  =  SE(Y)  =  4(-j— ) 


Obviously,  the  enemy  can  be  expected  to  target  only  those  squares  that  have  at  least 
one  loaded  shelter;  i.e.  Y  1.  On  this  assumption,  the  number  of  missiles  in  a  square, 
however,  may  be  estimated  only  in  terms  of  its  probability  as  follows: 


y|v  i  i)  -  p(y£T)'  Ky>  -  jfy  y’;  for  y  - 


whereas,  the  corresponding  expected  value  of  Y  is 

4 

E(Y|Y  1  1)  -  [  y  P(Y  -  y | Y  1) 

y-i 

Alternatively,  of  course,  the  enemy  may  target  only  those  squares  with  Y  2,  or 
Y  >  3,  or  only  those  squares  with  Y  «  4.  In  these  latter  cases,  the  corresponding 
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conditional  probabilities  and  expected  values  of  Y  are  respectively  as  follows: 
For  squares  with  Y  _>  2: 


P(Y  =  y|Y  >  2)  =  —A—  6  (^)y  (1  -a^r") 


4-y 


P(Y  >_  2)  y  'BN 


BN 


y  ^  2 


E(Y|Y  1  2)  =  l  y  P(Y  =  y  |  Y  >_  2) 
y=2 


For  squares  with  Y  3: 

P  (Y  =  y  | Y  >_  3)  = 


i  .  N  y  N  4-y 

- i -  A  (_“)  (1 _ SI) 

P(Y  >  3)  V  'BN  '  u  6Ng' 


E(Y  |  Y  >  3)  »  j  y  P(Y  =  yjY  >_  3) 
y=3 


y  1  3 


Finally  for  squares  with  Y  «  4: 


P(Y  =  4 | Y  =  4)  =  1.0 
E(Y|Y  -  4)  =  4.0 


The  conditional  expected  value  of  X,  therefore,  is 


E(x|y  i  y)  »B  E(y|y  >.  y);  y  =  1,2, 3, 4 


Again,  assuming  that  each  cluster  consists  of  four  shelters  in  a  square,  the 
number  of  squares  in  the  entire  MX  system,  therefore,  is 


N 

s 

4 

then,  the  number  of  squares  with  (Y  ^  1)  is, 

N 

Y  1  -  [P(Y  -  1)  +  P(Y  -  2)  +  P (Y  -  3)  +  P(Y  *  4))  x  j- 

N 

(Y  >  2)  ■  [P(Y-2)  +  P(Y-3)  +  P(Y-4) ]  ~ 

N 

(Y  >  3)  -  (P(Y-3)  +  P(Y-4)}  -f 


N 

(Y  -  4)  -  P(Y«4)  x 


Number 

of 

squares 

with 

Number 

of 

squares 

with 

Number 

of 

squares 

with 

and , 

Number 

of 

squares 

with 

total 
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r 


Therefore,  if  the  enemy  targets  only  those  squares  with  (Y  >_  2) ,  the  number  of 
successful  RV's  needed  by  the  enemy  will  be 


N 

[P(Y=2)  +  P(Y=3)  +  P(Y=4)]  ~r  ■ 

4 


In  which  case,  the  number  of  loaded  shelters  that  will  be  under  attack  will  be 


E(X|Y  2.  2)  x  (No.  of  squares  with  Y  >_  2) 


Hence,  the  probability  that  a  given  missile  will  be  under  attack  becomes 


P(A|Walk) 


E(X|Y  >_  2)  x  (No.  of  squares  with  Y  _>  2) 


N 

m 


Numerical  Example 

Suppose  that  the  MX  system  consists  of  4600  shelters,  among  which  200 
of  them  are  loaded  with  intercontinental  missiles.  It  is  probably  reasonable  to 
assume  that  this  information  is  known  by  the  enemy.  However,  at  any  given  time,  the 
enemy  would  not  know  exactly  where  the  200  missiles  may  be  placed  among  the  4600 
shelters.  To  illustrate  the  above  model,  assume  that  the  intelligence  measure  of 
the  enemy  is  6  =  1/10.  In  this  case,  6'=  ^q'q  “  would  be  equivalent  to  the  absence 
of  intelligence  information. 

Therefore, 


m  200 


SN  0.1x4600 
s 


=  0.435 


from  which. 


P(Y  =  0)  =  (0.435)°  (l  -  0.435)4  =  0.102 


P(Y  =  1)  =  4(0.435)  (1  -  0.435)  =  0.314 


P(Y  -  2)  =  (0.435)2  (1  -  0.435)2  =  0.362 


P (Y  =  3)  -  |r  (0.435) ^  (1  -  0.435)  -  0.186 


P(Y  -  4)  -  (0.435)  -  0.036 


4600 


Then,  the  number  of  squares  with  Y  >_  1  «  (1  -  0.102) — £ —  «  1033. 
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S  Lmilarly ; 


number  of  squares  with  (Y  _>  2)  =  (1  -  0.102  -  0.314)  1150  =  672 
number  of  squares  with  (Y  >  3)  =  (0.186  +  0.036)  1150  =  255 


number  of  squares  witli  (Y  =  4)  =  0.036  x  1150  =  41. 

Also , 

E(Y|Y  >  1)  =  (1  x  0.  314  +  2  x  0.362  +  3  x  0.186  +  4  x  0.036) 

■  u . 


r. (Y  I Y  >  2)  =  2.44 
K (Y  |  Y  >  3)  =  3.16 
E(Y|Y  =  4)  =  4.00 

and , 

K (X | Y  >  1)  =  1/10  x  1.94  =  0.194 
K(X|Y  >  2)  =  1/10  x  2.44  =  0.244 


K(X | Y  >  3)  =  1/ 10  x  3.16  =  0.316 
K(X|Y  =  4)  =  1/10  x  4.00  =  0.400 

The  last  four  figures  given  above  are  simply  the  expected  number  of  corners  with  Y  >  1, 
Y  2.  -i  Y  _>  3,  and  Y  =  4;  these  will,  of  course,  not  be  realized  in  practice. 

If  the  enemy  chooses  to  attack  only  those  squares  containing  at  least  two  loaded 
shelters,  i.e.  Y  2,  then  the  number  of  RV's  that  he  must  successfully  deploy  is 
672.  If  the  enemy  has  this  capability,  the  expected  number  of  missiles  in  the  MX 
system  that  will  be  under  attack  is  672  x  0.244  =  164.  Therefore,  the  proportion 
among  the  200  missiles  that  will  be  under  attack  is 

P  (A  |  walk )  =  =  0.82 

On  the  other  hand,  if  the  enemy  has  limited  capability,  say  less  than  300  RV's, 
he  may  have  to  target  only  those  squares  in  the  MX  system  with  Y  >  3;  according  to 
the  above  calculations,  the  number  of  successful  RV's  required  in  this  case  will 
be  255.  Accordingly,  the  expected  number  of  missiles  In  the  MX  system  that  will 
be  under  attack  Is  255  x  0.316  «  81.  Then,  the  proportion  of  the  active  missiles 
that  will  be  under  attack  is 
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P(A|walk) 


81 

200 


0.41 


7 . 3  On  the  Probability  of  Survival 

To  achieve  or  insure  a  given  survival  probability,  the  necessary  hardness 
(e.g.  expressed  in  terms  of  overpressure  capacity)  must  be  designed  into  the 
individual  shelter,  as  well  as  of  its  components  such  as  the  shock  isolation 
system  and  the  closure.  The  required  hardness  may  be  determined  by  applying 
the  appropriate  "factor  of  safety"  which  is  simply  a  factor  that  may  be  used  to 
amplify  the  weapon-induced  overpressure  in  order  to  determine  the  required  over 
capacity  of  the  shelter  for  insuring  the  desired  survival  probability.  For  this 
purpose,  the  appropriate  relationship  between  survival  probability  and  the  median 
safety  factor  ought  to  be  developed,  which  may  be  in  the  form  shown  schematically 
in  Fig.  7.2. 


Survival  Probability,  P(sjA) 


Figure  7.2  Safety  factor  versus  survival  probability 

That  is,  hardness  may  be  defined  as  the  median  overpressure  capacity  of  a  shelter 
or  of  its  associated  components.  In  this  context,  it  is  a  function  of  the  desired 
probability  of  survival  against  a  given  overpressure  environment,  as  indicated  in 
Fig.  7.2.  Conversely,  the  probability  of  survival  is  a  function  of  the  relative 
positions  between  the  PDF  (probability  density  function)  of  the  applied  overpressure 
and  the  PDF  of  the  overpressure  capacity  of  a  shelter  as  shown  in  Fig.  7.3.  The 
relative  positions  may  be  given  in  terms  of  the  ratio  of  the  median  overpressure 
capacity  to  the  median  of  the  applied  overpressure,  R/S,  which  may  be  called  the 
median  safety  factor.  Additionally,  the  survival  probability  is  also  a  function 
of  the  uncertainty  in  the  applied  environment  as  well  as  in  the  overpressure 
capacity  of  the  shelter.  The  uncertainties  in  the  applied  overpressure  will 

Include  those  associated  with  the  following  factors: 
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inaccuracy  in  the  overpressure-distance  relationship; 

error  in  the  estimation  of  the  coupling  effect; 

the  determination  of  the  effect  of  the  height-of-burst ; 

the  analysis  of  the  effect  of  soil-structure  interaction,  among  others. 

The  uncertainties  in  the  overpressure  capacity  of  a  shelter  facility  will 
include  those  arising  from  the  following: 

inaccuracy  in  the  determination  of  the  strength  of  vertical  concrete/steel 
cylinder; 

the  determination  of  the  strength  of  closure; 

the  determination  of  the  capacity  of  a  shock  isolation  system; 
error  in  the  calculation  of  the  response,  etc. 

7 . 4  A  Trade-Of f  Prob lem 

If  a  missile  is  under  attack,  its  probability  of  survival  under  a  "one  on 
one"  threat  would  be  lower  than  that  under  a  "walk"  scheme.  On  the  other  hand, 
for  the  same  number  of  RV's  and  the  same  enemy  intelligence,  the  probability  that  a 
missile  will  be  under  attack  is  correspondingly  lower  with  the  "one  on  one"  scheme  com¬ 
pared  with  the  "walk"  scheme,  especially  if  the  shelters  are  spaced  sufficiently  far 
apart. 

In  short,  the  number  of  surviving  missiles  is  a  function  of  the  probability  of 

attack  as  well  as  the  conditional  probability  of  surviving  an  attack.  If  there  is  a 

relative  likelihood  that  the  enemy  may  employ  one  or  the  other  schemes,  say  with 

probabilities  p  and  p  in  the  case  of  the  "one-on-one"  and  "walk"  schemes  such  that 
ro  w 

p  and  p  =1.0,  then  the  expected  NSM  is 
o  w 

NSM  ={[I  -  P(f|a  )  P (A  )]  p  +  [1  -  P(F|A  )  P(A  ))  p  }  N 
o  o  o  w  w  w  m 

where  P(f|a)  =  conditional  probability  of  failure  given  an  attack. 

In  the  case  of  the  "one-on-one"  scheme  the  probability  of  attack  P(A  )  is  simply 

a  function  of  N^,  and  6*  Implicitly,  this  probability  is  also  a  function  of  the 

total  number  of  shelters  N  ,  since  8  is  implicitly  a  function  of  N  . 

S  s 

The  probability  of  survival  of  a  given  missile,  once  it  is  under  attack,  will  be 

fairly  low  in  the  case  of  the  "one-on-one"  attack  scheme.  Its  chance  of  surviving  an 

enemy  RV  will  be  largely  due  to  the  weapon  inaccuracy,  i.e.  the  CEP.  Furthermore,  the 

probability  of  survival,  P(s|A  ),  will  not  be  affected  by  the  spacings  between  the 

shelters  if  the  enemy  deploys  the  "one-on-one"  scheme. 

However,  in  the  case  of  the  "walk"  scheme,  the  probability  of  attack  P(A  ),  is  a 

w 

function  of  N  ,  N  ,  N  ,  and  B.  That  is, 
e  m  s 
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P(A  )  =  g(N  ,  N  ,  N  , g  ) 
w  e  m  s 

Increasing  the  number  of  shelters,  Ng,  will  decrease  P(Aw>,  and  thus  increases  the 

NSM;  this  would  mean  using  closer  spacings  among  the  shelters  for  the  same  land  area 

or  the  same  total  cost.  However,  for  a  given  shelter  hardness,  the  probability  of 

survival,  of  course,  will  decrease  with  closer  spacings.  The  spacing  may  be  increased 

by  using  fewer  shelters;  this,  will  have  the  effect  of  increasing  P(A  )  but  will  also 

w 

increase  P(s|Aw>. 

Increasing  the  hardness  of  the  shelter  (i.e.  increasing  its  overpressure  capacity) 

against  a  given  weapon  yield  will  also  increase  the  probability  of  survival  P(s|A  ), 

w 

In  otherwords,  in  the  case  of  the  "walk"  scheme,  there  is  an  optimal  expected  NSM 

between  using  large  spacing  (say  9,000  ft)  with  fewer  shelters  vs.  using  closer  spacing 

(say  4,000  ft)  with  correspondingly  larger  number  of  shelters  and  higher  level  of  shelter 

hardness.  That  is,  for  the  "walk"  scheme,  increasing  the  hardness  as  well  as  increa  ;ing 

the  spacings  between  shelters  will  increase  the  survival  probability;  however,  within 

a  finite  land  area  or  limited  budget,  larger  spacings  between  the  shelters  will  mean 

smaller  number  of  shelters  and  thus  increasing  the  probability  of  attack  P(A  )  of  the 

w 

missiles. 

In  this  light,  there  is  therefore  a  trade-off  between  spacing  and  hardness  in  order 
to  achieve  the  maximum  possible  NSM  (the  number  of  surviving  missiles),  within  a  given 
total  budget  for  the  MX  system.  In  order  to  perform  such  a  trade-off  study  between  these 
major  factors,  the  necessary  information  must  be  developed.  Such  information  would  in¬ 
clude  the  following. 

(1)  The  relationship  between  the  probability  of  attack  P(A  )  as  a  function  of 

w 

the  variables  Ne>  N^,  ,  and  8.  These  relationships  may  then  be  used  to  develop  the 

relationships  of  P(A^)  as  a  function  of  the  shelter  spacing. 

(2)  In  the  case  of  the  probability  of  survival,  P(s|Aw),  increasing  the  spacing 
between  shelters  will  correspondingly  decrease  the  overpressure  on  a  given  shelter  for 
a  given  weapon  yield,  and  thus  increases  the  probability  of  survival.  Also,  increasing 
the  hardness  of  a  shelter  will  increase  the  survival  probability  since  the  capacity  of 

the  shelter  to  withstand  the  applied  overpressure  will  correspondingly  increase.  These 

relationships  should  also  be  developed. 
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APPENDIX  A:  EVALUATION  OF  UTILITY  IN  A 
DECISION  TREE 


The  procedure  is  as  follows: 

(i)  Rank  the  events  involved  in  the  decision  tree  in  order  of  preference 

such  that  E^  >  E^  >  . . •  >  E^;  where  the  symbol  >  means  "is  preferred  to. 

(ii)  Assign  (E^)  =  1.0  and  u(En)  =  0* 

(iii)  Establish  the  value  of  p  such  that  the  decision  maker  is  indifferent 


thus,  u(E2)  =  pu(E^)  +  (1-p)  u(E^) 


=  P 

(iv)  Repeat  step  (iii)  n-3  times  with  E^  replaced  each  time  by  E^,  ...»  E^  ^ 
respectively;  the  value  of  p  obtained  for  different  E^  will  generally 
be  different. 

(v)  At  this  step,  a  set  of  utilities  for  E^,  E2 ,  ...,  has  been  determined. 

However,  to  provide  cross  checking  on  these  values,  one  may  start  another 

series  of  inquiry  using  u(E,)  and  u(E  ,)  as  the  set  of  new  reference 

i  n-I 

points  and  determine  a  new  utility  value  for  E^,  namely  u'(E2),  such  that 
indifference  is  achieved  as  follows: 


where  u' (E0)  =  p  u(E.)  +  (1-p)  u(E  ,) 
L  l  n-i 

=  p  +  (1-p)  u(E  ) 
n-i 


If  the  utility  values  are  consistent,  u*(E2)  should  be  equal  to  uCE^) 
as  determined  earlier  in  step  (iii). 

(vi)  Repeat  step  (v)  n-4  times  with  E^  replaced  each  time  by  E^,  E^  2> 

respectively. 

If  any  inconsistencies  are  found  in  the  above  procedure,  repeat  the  questioning  process 
until  all  utility  values  agree  satisfactorily. 
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APPENDIX  B:  DETERMINATION  OF  UTILITY  FUNCTION 


A  procedure  for  determining  a  utility  function  is  as  follows: 

(i)  Identify  the  range  of  values  of  the  specific  attribute  covered  by  the 
decision  analysis. 

(ii)  Assign  utility  values  of  1.0  and  0,  respectively,  to  the  utilities  of 
the  two  extremes. 

(iii)  The  utility  of  an  intermediate  value  of  the  attribute  may  be  determined  from 
the  value  of  X  such  that  indifference  is  achieved  by  the  decision  maker 
on  the  following  lotteries: 


=  0.5  x  1  +  0.5  x  0  =  0.5 


where  X*  and  *X  are,  respectively,  the  extreme  values  of  the  atributes 
with  utilities  1.0  and  0,  respectively. 


(iv) 


Repeat  step  (iii)  by  replacing  X*  by  X^;  the  utility  of  another  value 
of  the  attribute  could  be  obtained  from  the  following  pair  of  indifferen 
lotteries: 


=  0.5  x  0.5  +  0.5  x  0  =  0.25 


(v)  Repeat  the  above  procedure  by  varying  the  values  of  the  attribute  on  the 
50-50  lottery,  to  obtain  the  utilities  for  other  attribute  values. 

(vi)  Plot  the  utility  values  as  a  function  of  the  attribute  value  X,  and  fit 
a  curve  through  the  set  of  points.  The  resultant  curve  is  the  utility 
function. 
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